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EG-VEGF NUCLEIC ACIDS AND POLYPEPTIDES AND METHODS OF USE 

BACKGROUND OF THE INVENTION 

Endothelial Cells 

5 The local microenvironment profoundly affects the phenotype and growth properties of vascular endothelial 

cells in a tissue- or organ-specific manner, but the nature of the local instructive signals is largely unknown. There is 
compelling evidence that the vascular endothelial growth factor (VEGF) and angiopoietin families of endothelial cell 
specific growth factors are essential for embryonic develcpment and for angiogenesis in a variety of physiological and 
pathological circumstances [Ferrara and Alitalo, Nature Medicine , 5:1359-1364 (1999); Carmeliet, Nature Medicine, 

10 6:389-395 (2000)]. There is also strong evidence for a local, tissue-specific, regulation of endothelial cell phenotype 

and growth [Aird et al., J. Cell Biol. , 138:1 1 1 7-1 124 (1997); Stewart and Wiley, Dev. Biol. , 84:183-192 (1981)]. The 
morphological and functional characteristics of endothelial cells vary extensively among different organs [Simionescu 
and Simionescu, Cell and Tissue Biology , Urban and Schwarzemberg, Baltimore, (1988) pp. 355-398]. Furthermore, 
the site of application determines the properties of new vessels to an even greater extent than the type of angiogenic 

15 factor tested [Dellian et al.. Am. J. Pathology . 149:59-71 (1996); Roberts et al., Am. J. Pathology , 153:1239-1248 

(1998)]. The molecular basis for this influence of the local microenvironment on the properties of the vasculature is 
unknown, but it is believed that the specialized stroma plays a major role [Dellian, supra ]. Conceivably, an integrated 
network of stimuli, which may include tissue-specific secreted proteins, in addition to cellular and extracellular matrix 
components, functions to determine the structure and function as well as modulate growth of the resident 

20 endothelium. 

Thus there is a current need to identify and characterize novel growth and differentiation factors that 
influence the growth and/or differentiation of endothelial cells. In addition to increasing our knowledge of the 
development of the vasculature, such compounds could be useful in the diagnosis and treatment of conditions 
associated with vascular tissue. 
25 Hormone Secreting Cells 

While there has been progress in the advancement of science and medical therapies, there is still a need for 
new treatments for the medical ailments of society. One approach to finding new treatments has been to study how 
the organism operates. In particular, of interest is how signaling cells control the behavior of the organism. For 
example, endocrine cells secrete signaling molecules called hormones wherein malfunctioning of secretion of these 
30 hormones can lead to a variety of disorders. 

Cells specialized for secretion of hormones include the cells of gonads, secreting testosterone (Leydig cell of 
testis), estrogen (theca interna cell of ovarian follicle) and progesterone (corpus luteum cell of ruptured ovarian 
follicle). While there are a variety of treatments in the medical field which utilize exogenous administration of 
testosterone, estrogen and progesterone, there remains a need to regulate the cells which produce these hormones. 
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Other cells specialized for secretion of hormones include the cells of the adrenal gland and the cells of the 
digestive system. For example, cells of the adrenal gland secrete epinephrine, norepinephrine and steroid hormones 
such as mineralocorticoids and glucocorticoids. Of particular interest is Cortisol which is produced in the cortex of the 
adrenal gland and which influences the metabolism of many cell types. Cells of the digestive system include those of 
5 the pancreas which secretes insulin. Insulin is secreted by the islets of Langerhans and is essential for the metabolism 

of carbohydrates. Insulin is used in the treatment and control of diabetes mellitus, however, there is still a need for 
efficient treatments for disorders such as diabetes. Other hormones of interest of the gut and respiratory tract 
include serotonin, endorphin, somatostatin, gastrin, secretin, cholecystokinin, glucagon and bombesin. 

There are numerous diseases and disorders associated with hormone secreting cells, in particular 
10 steroidogenic endothelial cells within endocrine glands. It would, therefore, be desirable to identify growth factors 

specifically affecting such endothelial cells. Such endothelial cell specific growth factors would be valuable tools for 
diagnosing and treating disorders associated with such cell types, and for identifying further drug candidates useful in 
diagnosis and treatment of such diseases. 

15 SUMMARY OF THE INVENTION 

The present invention is based on the identification and characterization of a novel, tissue-restricted, growth 
and differentiation factor that acts selectively on one endothelial cell type. This factor, referred to as endocrine gland- 
derived vascular endothelial growth factor (EG-VEGF), has been found to induce proliferation, migration and 
fenestrations in capillary endothelial cells derived from endocrine glands, but has shown no effect on a variety of other 

20 endothelial and non-endothelial cell types tested. As described in detail herein, EG-VEGF nucleic acids and 
polypeptides can be used in a number of assays and in diagnosis and treatment of conditions associated with hormone 
producing tissue. 

In one aspect, the present invention concerns a composition of matter, comprising an EG-VEGF polypeptide, 
or an agonist or antagonist of an EG-VEGF polypeptide, as herein described, in admixture with a pharmaceutically 

25 acceptable carrier. In one embodiment, this composition further comprises a vascular endothelial cell growth factor 

(VEGF) or an agonist or antagonist therof . 

In a further aspect, the present invention provides an article of manufacture comprising a container, a label 
on the container and a composition comprising an active agent that is contained within the container. The active 
agent is selected from the group consisting of an EG-VEGF polypeptide, an agonist of an EG-VEGF polypeptide and an 

30 antagonist of an EG-VEGF polypeptide. The label on the container indicates that the composition is effective for 

treating a condition that is associated with a hormone producing endothelial tissue. In one embodiment of this aspect 
of the invention, the condition is associated with steroidogenic endothelial cells within an endocrine gland. In a second 
embodiment, the tissue is ovarian, testicular, cervical, adrenal, placental or prostate tissue. The condition may 
preferably be infertility, polynistic ovary syndrome or cancer. 
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In another aspect, the present invention provides for a method for identifying a compound that binds to EG- 
VEGF. This method comprises contacting a candidate compound with EG-VEGF and determining whether the 
candidate compound binds to the EG-VEGF. In one embodiment, the assay is a competitive binding assay, i.e. the 
ability of the candidate compound to compete with a molecule known to bind EG-VEGF is measured. The assay may, 
5 for example, be a cell-based assay wherein the candidate compound is contacted with a whole cell or a cell membrane 

fraction expressing the coding sequence of EG-VEGF. 

The present invention also concerns a method for identifying a compound that modulates a biological activity 
of EG-VEGF. This method comprises the steps of contacting a candidate compound with EG-VEGF and determining if 
an alteration in a biological activity of EG-VEGF has occurred. In one embodiment, the compound inhibits a biological 

10 activity of EG-VEGF, and in another embodiment the compound enhances a biological activity of EG-VEGF. The 

biological activity may, for example, be the ability to induce phosphorylation of a kinase involved in cell proliferation or 
survival. In a preferred embodiment, the kinase is a MAP kinase, more preferably ERK1 or ERK2. In other 
embodiments, the biological activity is cell proliferation, induction of chemotaxis, angiogenesis, induction of cell 
differentiation or the induction of endothelial cell fenestration. Just as before, the assay may, for example, be a cell- 

1 5 based assay wherein the candidate molecule is contacted with a whole cell or a cell membrane fraction expressing the 

coding sequence of EG-VEGF. In a preferred embodiment, the cell is a recombinant host cell engineered to express EG- 
VEGF. Alternatively, the candidate molecule may be contacted with an isolated EG-VEGF. In a preferred embodiment, 
the EG-VEGF is immobilized on a solid support. 

In another aspect, the invention specifically covers compounds that have been identified by one of the 

20 assays described above as having the ability to bind EG-VEGF or modulate a biological activity of EG-VEGF. 

Additionally provided herein is a method of identifying a receptor for EG-VEGF. This method comprises 
combining EG-VEGF with a composition that comprises cell membrane material wherein the EG-VEGF complexes with a 
receptor on the cell memberane material and the receptor is identified as an EG-VEGF receptor. In one embodiment of 
this aspect of the invention, EG-VEGF binds to the receptor and EG-VEGF and the receptor are crosslinked. 

25 Another aspect of this invention concerns a method of stimulating a biological activity in cells or tissue. This 

includes a method of stimulating cell proliferation, a method of inducing chemotaxis in cells, a method of inducing 
angiogenesis in hormone producing tissue, a method of inducing cell differentiation in cells, and a method of inducing 
fenestration in cells. In each of these methods, the biological activity is stimulated by contacting the cells or tissue 
with EG-VEGF or an EG-VEGF agonist in an amount effective to induce the desired biological activity, or by introducing 

30 a nucleic acid encoding EG-VEGF or an EG-VEGF agonist into the cells or tissue in an amount effective to induce the 

biological activity. The cells preferably are endothelial cells, in particular hormone producing endothelial cells or 
tissue. 

The invention also provides for methods of inhibiting all of the biological activities described above by 
contacting the cells or hormone producing tissue with an EG-VEGF antagonist in an amount effective to inhibit the 
35 biological activity, or by introducing a nucleic acid encoding an EG-VEGF antagonist into the cells. 



WO 02/00711 



PCT/US01/20116 



An additional aspect of the invention provides for methods of treating individuals. These methods include a 
method for treating an individual for a condition associated with hormone producing tissue, a method for regulating 
fertility in an individual, a method of treating cancer in cells responsive to EG-VEGF in an individual, a method of 
treating cancer of the reproductive organs in an individual, and a method of treating an ovarian cyst in an individual. 
5 In one embodiment of the methods of this aspect of the invention, the methods comprise administering to the 

individual a composition comprising EG-VEGF or an agonist or antagonist thereof in an amount effective to treat the 
condition, regulate fertility, treat the cancer or treat the ovarian cyst. In another embodiment, the method comprises 
administering to the individual a composition comprising a nucleic acid encoding EG-VEGF or an agonist or antagonist 
thereof in an amount effective to treat the condition. In a particular embodiment, fertility is regulated by inhibiting 
10 follicle maturation. In a further embodiment, fertility is regulated by inhibiting ovulation. In another embodiment the 

individual has or is at risk of having polycystic ovary syndrome and fertility is regulated so as to maintain fertility. 
The cancer to be treated may, for example, be ovarian cancer, testicular cancer, prostate cancer, or uterine cancer. 
The preferred embodiments described below apply to all aspects of the invention. 
In one embodiment, EG-VEGF polypetide is a native sequence EG-VEGF. In a further embodiment the native 
15 sequence EG-VEGF is human. In another emodiment, the EG-VEGF is a fragment of a native sequence EG-VEGF. In still 

another embodiment, the EG-VEGF is an amino acid sequence variant of a native sequence EG-VEGF, which preferably 
• has at least about 85% sequence identity to the sequence of amino acid residues from about 1 or about 20 to about 
105, inclusive, of Figure 2 (SEQ ID NO: 2). In yet another embodiment, the amino acid sequence variant is a 
conservative substitution variant. In another embodiment the EG-VEGF is a fragment of an amino acid sequence 
20 variant of a native sequence EG-VEGF. In another embodiment the EG-VEGF is present as a fusion protein. 

The compositions and methods described above may further involve the use of a VEGF polypeptide. In one 
embodiment, the VEGF polypetide is a native sequence VEGF. In a further embodiment the native sequence VEGF is 
human. In another emodiment, the VEGF is a fragment of a native sequence VEGF. In still another embodiment, the 
VEGF is an amino acid sequence variant of a native sequence VEGF. In a further embodiment the amino acid sequence 
25 variant has at least about 85% sequence identity to the sequence of native sequence VEGF. In yet another 

embodiment, the amino acid sequence variant is a conservative substitution variant. In another embodiment the VEGF 
is a fragment of an amino acid sequence variant of a native sequence VEGF. In another embodiment the VEGF is 
present as a fusion protein. 

In one embodiment, the EG-VEGF agonist or VEGF agonist is an anti-EG-VEGF antibody or anti-VEGF antibody, 
30 respectively, specifically including antibody fragments. In another embodiment, the EG-VEGF or VEGF agonist is a 

small molecule. 

Similarly, the EG-VEGF or VEGF antagonist may, for example be an anti-EG-VEGF or anti-VEGF antibody, 
respectively, specifically including antibody fragments. In another embodiment the EG-VEGF or VEGF antagonist is a 
small molecule. 
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In one embodiment, the whole cell or cell membrane fraction expressing the coding sequence of EG-VEGF is a 
recombinant host cell engineered to express EG-VEGF. 

Other embodiments and variations of the present invention will become apparent by the description of the 
invention given in detail below. 

5 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 shows the nucleotide sequence (SEQ ID M0:1) of a cDNA containing a nucleotide sequence 
(nucleotides 91-405) encoding native sequence EG-VEGF, wherein the nucleotide sequence (SEQ ID N0:1) is a clone 
designated herein as cDNA60621-1516". Also presented in bold font and underlined are the positions of the 
10 respective start and stop codons. 

Figure 2 shows the amino acid sequence (SEQ ID N0:2) of a native sequence EG-VEGF polypeptide as derived 
from the coding sequence of SEQ ID N0:1. Also shown are the approximate locations of various other important 
polypeptide domains. 

Figures 3A-D show hypothetical exemplifications for using the below described method to determine % 
15 amino acid sequence identity (Figures 3A-B) and % nucleic acid sequence identity (Figures 3C-D) using the ALIGN-2 

sequence comparison computer program, wherein "PRO" represents the amino acid sequence of a hypothetical EG- 
VEGF polypeptide of interest, "Comparison Protein" represents the amino acid sequence of a polypeptide against 
which the "PRO" polypeptide of interest is being compared, "PRO-DNA" represents a hypothetical EG-VEGF-encoding 
nucleic acid sequence of interest, "Comparison DNA" represents the nucleotide sequence of a nucleic acid molecule 
20 against which the "PRO-DNA" nucleic acid molecule of interest is being compared, "X, "Y" and "Z" each represent 
different hypothetical amino acid residues and "N", "L" and "V" each represent different hypothetical nucleotides. 
Figure 4 shows a nucleotide sequence designated herein as DNA56748 (SEQ ID N0:3). 
Figures 5A-5F show EG-ECGF in situ expression in ovaries. Figures 5A and 5B show 2-year-old chimp ovary 
where Figure 5A shows a hematoxylin-eosin stain and Figure 5B shows expression using FITC. Figures 5C and 5D 
25 show cyno monkey ovary where Figure 5C shows a hematoxylin-eosin stain and Figure 5D shows EG-ECGF expression 

using FITC. Figures 5E and 5F show chimp stromal ovary where Figure 5E shows a hematoxylin-eosin stain and 
Figure 5F shows EG-ECGF expression using FITC. 

Figures 6A-D show actin and EG-ECGF in situ expression in frozen and paraffin-embedded human ovaries. 
Figure 6A shows a hematoxylin-eosin stain and Figure 6B shows EG-ECGF expression using FITC. Figure 6C shows a 
30 hematoxylin-eosin stain and Figure 6D shows EG-ECGF expression using FITC. 

Figure 7A shows an autoradiogram of EG-ECGF in situ expression in an ovary from a 46 year old woman 
with uterine prolapse and Figure 7B shows a hematoxylin-eosin stain of the same ovary. 

Figures 8A-D show EG-ECGF in situ expression in adrenal gland. Figures 8A and 8B show nucleic acid 
identification using DAPI and Figures 8C and 8D show protein identification using FITC. 
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Figure 9 In situ hybridization studies revealed that in the human testis, EG-VEGF transcript is restricted to 
the testosterone-producing Leydig cells (panels A-D). E, F) EG-VEGF signal is prevalent in the human ovarian stroma in 
cells that can be described as perivascular (K,L). A very similar pattern was evidenced in the chimpanzee ovary (data 
not shown). G,H) A strong signal is detected in the human corpus hemmorhagicum. In human (l,J) and chimpanzee 
5 (M,N,0,P) developing follicles, EG-VEGF RNA is expressed in the theca and granulosa cells ■ both steroidogenic cell 

types, and the cumulus oorphous. A,C,E,G,l,K,M,0 are bright-field images, and B,D,F,H,J,L,I\I,P the corresponding 
dark-field images. Arrows point to the course of a blood vessel. 

Figures 10A-C are a displacement plot and scatchard plot, respectively, showing 125 l-EG-VEGF-his ligand 
binding analysis in bovine adrenal cortical capillary endothelial cells (ACE). 
10 Figures 11A-B are a displacement plot and scatchard plot, respectively, showing 125 l-EG-VEGF-his ligand 

binding analysis in MS-1, a mouse endothelial cell line derived from the endocrine pancreas. Figure 11C shows EG- 
VEGF-his ligand binding analysis in human umbilical vein endothelial cells (HUVEC). 

Figures 12A-E show bar graphs indicating relative proliferation of cells using either a control, bFGF, VEGF, or 
EG-VEGF in the concentration of 1 nM, 10 nM or 25 nM. Figure 12A shows the results using pericytes. Figure 12B 
15 shows the results using human aortic vascular smooth muscle cells (HA-VSMC). Figure 12C shows the results using 

baby hamster kidney fibroblasts (BHK21). Figure 12D shows the results using ACE. Figure 12E shows the results 
using bovine brain capillary endothelial cells (BBC). 

Figure 13 shows that EG-VEGF is a mitogen and chemoattractant for specific endothelial cells. Panel a 
shows that in ACE cultures, EG-VEGF induced a maximal mitogenic response at 2 nM, with an ED 5D of 0.2 nM. Panel b 
20 shows the results of proliferation assays with several endothelial cell types: HUVEC, HMVEC, BBC, ABAE. Panel c 

shows the results of proliferation assays with non-endothelial cell types: human aortic vascular smooth muscle cells 
(VSMC), pericytes and fibroblasts (BHK21) and human neonatal fibroblasts -hFb) and keratinocytes. Basal media 
served as a negative control (Ct); bFGF (F), EGF (E), or VEGF (V), added respectively at 5.5 and 10 ng'ml, served as 
positive controls. EG-VEGF was tested at 1, 10, and 100 nM. Panel d shows that EG-VEGF induced a chemotactic 
25 response in ACE cells, primary baboon adrenal endothelial cells (BAEC) and MS-1, but not in HUVEC. Each graph is a 

representative experiment. Data are mean values with error bars indicating standard deviation, and the proliferation 
or migration index is relative to the negative control arbitrarily set to the value 1. Panels e-g illustrate that EG-VEGF 
induces fenestration in adrenal cortex-derived capillary endothelial cells. ACE, growth to confluence on ECM, were 
treated with 2.5 nM VEGF, 10 nM EG-VEGF, or the combination of the two factors. Electron micrographs of ACE cells 
30 untreated (panel e), treated with VEGF (panel f), or EG-VEGF (panel g) revealed that both molecules are capable of 
inducing f enestrae. Arrowheads indicate the locations of fenestrae. Magnification is indicated (e and f |im; g nm). 

Figures 14A and 14B show relative endothelial cell migration in a control, with VEGF, or EG-VEGF in a 
concentration of .2 nM, .5 nM, 1 nM or 5 nM. 

Figure 15 shows the phosphorylation of the MAP kinases Erk1 and 2 after exposure to EG-VEGF. 
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Figures 16A-C show EG-VEGF cDNA and amino acid sequences and alignments with homologous proteins. 
Figure 16A shows the 1.4kb human EG-VEGF cDIMA sequence. The 1.4 kb human EG-VEGF cDNA encodes a protein 
of 105 amino acids, with a classical signal sequence of 19 amino acids (underlined), and a mature protein of 86 amino 
acids. A striking feature of the primary protein structure is the cysteine content, 10 residues that potentially form 5 
5 disulfide bridges. Figure 16B shows an alignment of the sequences of human EG-VEGF, human Bv8 homologue (SEQ 

ID NO: 4), and snake VPRA (SEQ ID NO: 5). Boxed residues indicate identity. Figure 16C is an alignment of human 
EG-VEGF, human dickkopf-3 (hdkl<3, SEQ ID NO: 6), Xenopus dkk-1 (xdkkl, SEQ ID NO: 7) and porcine colipase (col, 
SEQ ID NO: 8). The alignment illustrates the conserved cysteines that form the characteristic disulfide-bonding 
pattern of this protein domain - the colipase fold. This motif in EG-VEGF is 37% identical and 41% homologous to the 

10 cysteine-rich C-terminal domain of human dkk-3; and 32% idential, 42% homologous to the Xenopus dkk-1 domain. 

Numbers indicate aa position in the respective protein and bozed residues are identical to EG-VEGF. 

Figures 17A-B illustrate hypoxic-regulation of EG-VEGF. Figure 17A shows Taqman analysis of RNA from 
SW13 (closed box) and H295 (open box) adrenal carcinoma cells exposed to normoxic (22% 0 2 ) versus hypoxic (2% OJ 
conditions for 18 hours revealed that EG-VEGF transcription increased 275% and 210% in SW13 and H295R, 

15 respectively, comparable to the 350% and 252% increases in VEGF over the normoxic controls. VEGF and EG-VEGF 

data was normalized to -actin. Figure 17B shows HRE lucif erase activities in normoxic (open box) versus hypoxic 
(closed box) conditions. Activities of luciferase reporters and vector were normalized to the co-transfected Renilla 
luciferase. Both the Epo consensus and EG-VEGF constructs were induced approximately 3.4-fold in hypoxia above 
their respective normoxic controls. Mutation of the core sequence, in z^omutant and EG-l/EGFmutant, abrogated the 

20 responsiveness to hypoxic conditions, resulting in activities similar to vector control. 

Figure 18 shows northern blot analysis of EG-VEGF expression. Northern blot analyses of human RNA 
samples revealed a single transcript of approximately 1 .4 kb. Expression is highest in ovary and testis, followed by 
adrenal and placenta. A less abundant signal, apparent after longer exposure, is present in prostate. Equivalent RNA 
loading was assessed by hybridization with the control actin probe (data not shown). Contents of the lanes are 

25 indicated above the blots, and the size (kb) is indicated at the right. 

Figure 1 9 illustrates the selectivity of in mo angiogenic effects of EG-VEGF. Panels a-c show the results of 
a rat corneal pocket assay. Note the strong angiogenic response induced by VEGF protein, while EG-VEGF has 
essentially no effect. Panels d-f show the results obtained by unjecting adCMV-lacZ, AdCMV-VEGF 164 , or AdCMV-EG- 
VEGF (5x1 0 a pfu) in the skeletal muscle (sm) of nude rats. Arrowheads point to microspheres marking the injection 

30 site, arrows point to new blood vessels. Note the angiogenic response, with abundant new vessel formation, induced 

by VEGF, while both lacZ and EG-VEGF Av had no appreciable effects. Panels g-h show the results of VEGF and EG- 
VEGF Av injection in the mouse ear. Again, VEGF Av resulted in a strong angiogenic response, which was absent in 
the animals injected with EG-VEGF Av (scale bar in d-h= 100 |j.m). Panel i shows the gross appearance of ovaries 
(ov) following injection of LacZ (ct) VEGF or EG-VEGF Av, after seven days (scale bar= 0.5 cm). Note the much larger 

35 mass plus the presence superficial vessels and hemorrhagic areas in both VEGF and EG-VEGF groups. Panels j-n are 
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micrographs of ovaries injected with Av vectors (5x1 0" pfu), as indicated. Note the normal architecture and 
morphology of the lacZ ovary in j. Corpora lutea (CL) are indicated. In contrast, the VEGF (k) and EG-VEGF (I) groups 
revealed very similar changes, with large fluid-filled or hemorrhagic cystic areas (*) (scale bar in j-l = 1 mm). Panels 
m-n are high power micrographs (scale bar = 33 \xm) of boxed areas in k and I, respectively. Areas of intense 
5 angiogenesis are evident at the periphery of cystic lesions. Arrows point toward blood vessels. 

Figures 20 A-Q provide the complete source code for the ALIGN-2 sequence comparison computer program. 
This source code may be routinely compiled for use on a UNIX operating system to provide the ALIGN-2 sequence 
comparison computer program. 

10 DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

I. Definitions 

Unless defined otherwise, technical and scientific terms used herein have the same meaning as commonly 
understood by one of ordinary skill in the art to which this invention belongs. See, e.g. Singleton et a/., Dictionary of 
Microbiology and Molecular Biology 2nd ed., J. Wiley & Sons (New York, NY 1994); Sambrook et al., Molecular 

15 Cloning, A Laboratory Manual, Cold Springs Harbor Press (Cold Springs Harbor, NY 1989). For purposes of the 

present invention, the following terms are defined below. 

The terms "EG-VEGF polypeptide", "EG-VEGF protein" and "EG-VEGF" (and also "VRPA polypeptide", "VRPA 
protein" and "VRPA", terms used earlier to describe the same polypeptides) when used herein encompass native 
sequence EG-VEGF and EG-VEGF polypeptide variants (which are further defined herein). The EG-VEGF polypeptide 

20 may be isolated from a variety of sources, such as from human tissue types or from another source, or prepared by 

recombinant and/or synthetic methods. 

A "native sequence EG-VEGF" comprises a polypeptide having the same amino acid sequence as an EG-VEGF 
derived from nature. Such native sequence EG-VEGF can be isolated from nature or can be produced by recombinant 
and/or synthetic means. The term "native sequence EG-VEGF" specifically encompasses naturally-occurring truncated 

25 or secreted forms (e.g., an extracellular domain sequence), naturally-occurring variant forms (e.g., alternatively spliced 

forms) and naturally-occurring allelic variants of the EG-VEGF. In one embodiment of the invention, the native 
sequence EG-VEGF is a mature or full-length native sequence EG-VEGF comprising amino acids 1 to 105 of Figure 2 
(SEQ ID N0:2). Also, while the EG-VEGF polypeptide disclosed in Figure 2 (SEQ ID N0:2) is shown to begin with the 
methionine residue designated herein as amino acid position 1, it is conceivable and possible that another methionine 

30 residue located either upstream or downstream from amino acid position 1 in Figure 2 (SEQ ID N0:2) may be employed 

as the starting amino acid residue for the EG-VEGF polypeptide. 

"EG-VEGF variant polypeptide" means an active EG-VEGF polypeptide as defined below having at least about 
80% amino acid sequence identity with the amino acid sequence of (a) residues 1 or about 20 to 105 of the EG-VEGF 
polypeptide shown in Figure 2 (SEQ ID N0:2), (b) X to 105 of the EG-VEGF polypeptide shown in Figure 2 (SEQ ID 

35 N0:2), wherein X is any amino acid residue from 14 to 24 of Figure 2 (SEQ ID N0:2), or (c) another specifically derived 
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fragment of the amino acid sequence shown in Figure 2 (SED ID N0:2]. Such EG-VEGF variant polypeptides include, 
for instance, EG-VEGF polypeptides wherein one or more amino acid residues are added, or deleted, at the l\l- and/or C- 
terminus, as well as within one or more internal domains, of the sequence of Figure 2 (SEQ ID N0:2). Ordinarily, a EG- 
VEGF variant polypeptide will have at least about 80% amino acid sequence identity, more preferably at least about 
5 81% amino acid sequence identity, more preferably at least about 82% amino acid sequence identity, more preferably 

at least about 83% amino acid sequence identity, more preferably at least about 84% amino acid sequence identity, 
more preferably at least about 85% amino acid sequence identity, more preferably at least about 86% amino acid 
sequence identity, more preferably at least about 87% amino acid sequence identity, more preferably at least about 
88% amino acid sequence identity, more preferably at least about 89% amino acid sequence identity, more preferably 

10 at least about 90% amino acid sequence identity, more preferably at least about 91% amino acid sequence identity, 

more preferably at least about 92% amino acid sequence identity, more preferably at least about 93% amino acid 
sequence identity, more preferably at least about 94% amino acid sequence identity, more preferably at least about 
95% amino acid sequence identity, more preferably at least about 96% amino acid sequence identity, more preferably 
at least about 97% amino acid sequence identity, more preferably at least about 98% amino acid sequence identity 

15 and yet more preferably at least about 99% amino acid sequence identity with (a) residues 1 or about 20 to 105 of 

the EG-VEGF polypeptide shown in Figure 2 (SEQ ID NQ:2), (b) X to 105 of the EG-VEGF polypeptide shown in Figure 2 
(SEQ ID N0:2), wherein X is any amino acid residue from 14 to 24 of Figure 2 (SEQ ID ND:2), or (c) another 
specifically derived fragment of the amino acid sequence shown in Figure 2 (SEQ ID N0:2). EG-VEGF variant 
polypeptides do not encompass the native EG-VEGF polypeptide sequence. Ordinarily, EG-VEGF variant polypeptides 

20 are at least about 10 amino acids in length, often at least about 20 amino acids in length, more often at least about 

30 amino acids in length, more often at least about 40 amino acids in length, more often at least about 50 amino acids 
in length, more often at least about 60 amino acids in length, more often at least about 70 amino acids in length, more 
often at least about 80 amino acids in length, more often at least about 90 amino acids in length, more often at least 
about 100 amino acids in length, more often at least about 150 amino acids in length, more often at least about 200 

25 amino acids in length, more often at least about 250 amino acids in length, more often at least about 300 amino acids 

in length, or more. 

"Percent (%] amino acid sequence identity" with respect to the EG-VEGF polypeptide sequences identified 
herein is defined as the percentage of amino acid residues in a candidate sequence that are identical with the amino 
acid residues in a EG-VEGF sequence, after aligning the sequences and introducing gaps, if necessary, to achieve the 

30 maximum percent sequence identity, and not considering any conservative substitutions as part of the sequence 

identity. Alignment for purposes of determining percent amino acid sequence identity can be achieved in various ways 
that are within the skill in the art, for instance, using publicly available computer software such as BLAST, BLAST-2, 
ALIGN, ALIGN-2 or Megalign (DNASTAR) software. Those skilled in the art can determine appropriate parameters for 
measuring alignment, including any algorithms needed to achieve maximal alignment over the full-length of the 

35 sequences being compared. For purposes herein, however, % amino acid sequence identity values are obtained as 
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described below by using the sequence comparison computer program ALIGN-2, wherein the complete source code for 
the ALIGN-2 program is provided in Figure 20. The ALIGN-2 sequence comparison computer program was authored by 
Genentech, Inc. and the source code shown in Figure 20 has been filed with user documentation in the U.S. Copyright 
Office, Washington D.C., 20559, where it is registered under U.S. Copyright Registration No. TXU510087. The 
5 ALIGN-2 program is publicly available through Genentech, Inc., South San Francisco, California or may be compiled 
from the source code provided herein. The ALIGN-2 program should be compiled for use on a UNIX operating system, 
preferably digital UNIX V4.0D. All sequence comparison parameters are set by the ALIGN-2 program and do not vary. 

For purposes herein, the % amino acid sequence identity of a given amino acid sequence A to, with, or 
against a given amino acid sequence B (which can alternatively be phrased as a given amino acid sequence A that has 
10 or comprises a certain % amino acid sequence identity to, with, or against a given amino acid sequence B) is 

calculated as follows: 

100 times the fraction X/Y 

15 where X is the number of amino acid residues scored as identical matches by the sequence alignment program ALIGN- 

2 in that program's alignment of A and B, and where Y is the total number of amino acid residues in B. It will be 
appreciated that where the length of amino acid sequence A is not equal to the length of amino acid sequence B, the 
% amino acid sequence identity of A to B will not equal the % amino acid sequence identity of B to A. As examples of 
% amino acid sequence identity calculations, Figures 3A-B demonstrate how to calculate the % amino acid sequence 

20 identity of the amino acid sequence designated "Comparison Protein" to the amino acid sequence designated "PRO". 

Unless specifically stated otherwise, all % amino acid sequence identity values used herein are obtained as 
described above using the ALIGN-2 sequence comparison computer program. However, % amino acid sequence 
identity may also be determined using the sequence comparison program NCBI-BLAST2 (Altschul et al.. Nucleic Acids 
Res. 25:3389-3402 (1997)). The NCBI-BLAST2 sequence comparison program may be downloaded from 

25 http:/|www.ncbi.nlm.nih.gov. NCBI-BLAST2 uses several search parameters, wherein all of those search parameters 

are set to default values including, for example, unmask = yes, strand = all, expected occurrences - 10, minimum 
low complexity length = 15/5, multi-pass e-value = 0.01, constant for multi-pass = 25, dropoff for final gapped 
alignment = 25 and scoring matrix = BL0SUM62. 

In situations where NCBI-BLAST2 is employed for amino acid sequence comparisons, the % amino acid 

30 sequence identity of a given amino acid sequence A to, with, or against a given amino acid sequence B (which can 

alternatively be phrased as a given amino acid sequence A that has or comprises a certain % amino acid sequence 
identity to, with, or against a given amino acid sequence B) is calculated as follows: 

100 times the fraction X/Y 

35 
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where X is the number of amino acid residues scored as identical matches by the sequence alignment program NCBI- 
BLAST2 in that program's alignment of A and B, and where Y is the total number of amino acid residues in B. It will 
be appreciated that where the length of amino acid sequence A is not equal to the length of amino acid sequence B, 
the % amino acid sequence identity of A to B will not equal the % amino acid sequence identity of B to A. 
5 "EG-VEGF variant polynucleotide" or "EG-VEGF variant nucleic acid sequence" means a nucleic acid molecule 

which encodes an active EG-VEGF polypeptide as defined below and which has at least about 80% nucleic acid 
sequence identity with either (a) a nucleic acid sequence which encodes residues 1 or about 20 to 105 of the EG-VEGF 
polypeptide shown in Figure 2 (SEQ ID N0:2), (b) a nucleic acid sequence which encodes amino acids X to 105 of the 
EG-VEGF polypeptide shown in Figure 2 (SEQ ID l\IO:2), wherein X is any amino acid residue from 14 to 24 of Figure 2 

10 (SEQ ID N0:2), or (c) a nucleic acid sequence which encodes another specifically derived fragment of the amino acid 

sequence shown in Figure 2 (SEQ ID I\I0:2). Ordinarily, a EG-VEGF variant polynucleotide will have at least about 80% 
nucleic acid sequence identity, more preferably at least about 81% nucleic acid sequence identity, more preferably at 
least about 82% nucleic acid sequence identity, more preferably at least about 83% nucleic acid sequence identity, 
more preferably at least about 84% nucleic acid sequence identity, more preferably at least about 85% nucleic acid 

15 sequence identity, more preferably at least about 86% nucleic acid sequence identity, more preferably at least about 

87% nucleic acid sequence identity, more preferably at least about 88% nucleic acid sequence identity, more 
preferably at least about 89% nucleic acid sequence identity, more preferably at least about 90% nucleic acid 
sequence identity, more preferably at least about 91% nucleic acid sequence identity, more preferably at least ahout 
92% nucleic acid sequence identity, more preferably at least about 93% nucleic acid sequence identity, more 

20 preferably at least about 94% nucleic acid sequence identity, more preferably at least about 95% nucleic acid 
sequence identity, more preferably at least about 96% nucleic acid sequence identity, more preferably at least about 
97% nucleic acid sequence identity, more preferably at least about 98% nucleic acid sequence identity and yet more 
preferably at least about 99% nucleic acid sequence identity with either (a) a nucleic acid sequence which encodes 
residues 1 or about 20 to 105 of the EG-VEGF polypeptide shown in Figure 2 (SEQ ID N0:2), (b) a nucleic acid 

25 sequence which encodes amino acids X to 1 05 of the EG-VEGF polypeptide shown in Figure 2 (SEQ ID I\I0:2), wherein 

X is any amino acid residue from 14 to 24 of Figure 2 (SEQ ID N0:2), or (c) a nucleic acid sequence which encodes 
another specifically derived fragment of the amino acid sequence shown in Figure 2 (SEQ ID I\I0:2). EG-VEGF 
polynucleotide variants do not encompass the native EG-VEGF nucleotide sequence. 

Ordinarily, EG-VEGF variant polynucleotides are at least about 30 nucleotides in length, often at least about 

30 60 nucleotides in length, more often at least about 90 nucleotides in length, more often at least about 1 20 nucleotides 

in length, more often at least about 150 nucleotides in length, more often at least about 180 nucleotides in length, 
more often at least about 210 nucleotides in length, more often at least about 240 nucleotides in length, more often 
at least about 270 nucleotides in length, more often at least about 300 nucleotides in length, more often at least 
about 450 nucleotides in length, more often at least about 600 nucleotides in length, more often at least about 900 

35 nucleotides in length, or more. 
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"Percent (%) nucleic acid sequence identity" with respect to the EG-VEGF polypeptide-encoding nucleic acid 
sequences identified herein is defined as the percentage of nucleotides in a candidate sequence that are identical with 
the nucleotides in a EG-VEGF polypeptide-encoding nucleic acid sequence, after aligning the sequences and introducing 
gaps, if necessary, to achieve the maximum percent sequence identity. Alignment for purposes of determining percent 
5 nucleic acid sequence identity can be achieved in various ways that are within the skill in the art, for instance, using 
publicly available computer software such as BLAST, BLAST-2, ALIGN, ALIGN-2 or Megalign (DNASTAR) software. 
Those skilled in the art can determine appropriate parameters for measuring alignment, including any algorithms 
needed to achieve maximal alignment over the full-length of the sequences being compared. For purposes herein, 
however, % nucleic acid sequence identity values are obtained as described below by using the sequence comparison 

10 computer program ALIGN-2, wherein the complete source code for the ALIGN-2 program is provided below. The 

ALIGN-2 sequence comparison computer program was authored by Genentech, Inc. and the source code shown below 
has been filed with user documentation in the U.S. Copyright Office, Washington D.C., 20559, where it is registered 
under U.S. Copyright Registration No. TXU5 10087. The ALIGN-2 program is publicly available through Genentech, 
Inc., South San Francisco, California or may be compiled from the source code provided below. The ALIGN-2 program 

15 should be compiled for use on a UNIX operating system, preferably digital UNIX V4.0D. All sequence comparison 

parameters are set by the ALIGN-2 program and do not vary. 

For purposes herein, the % nucleic acid sequence identity of a given nucleic acid sequence C to, with, or 
against a given nucleic acid sequence D (which can alternatively be phrased as a given nucleic acid sequence C that 
has or comprises a certain % nucleic acid sequence identity to, with, or against a given nucleic acid sequence D) is 

20 calculated as follows: 

100 times the fraction W/Z 

where W is the number of nucleotides scored as identical matches by the sequence alignment program ALIGN-2 in that 
25 program's alignment of C and D, and where Z is the total number of nucleotides in D. It will be appreciated that 

where the length of nucleic acid sequence C is not equal to the length of nucleic acid sequence D, the % nucleic acid 
sequence identity of C to D will not equal the % nucleic acid sequence identity of D to C. As examples of % nucleic 
acid sequence identity calculations, Figures 3C-0 demonstrate how to calculate the % nucleic acid sequence identity 
of the nucleic acid sequence designated "Comparison DNA" to the nucleic acid sequence designated "PRO-DNA". 
30 Unless specifically stated otherwise, all % nucleic acid sequence identity values used herein are obtained as 

described above using the ALIGN-2 sequence comparison computer program. However, % nucleic acid sequence 
identity may also be determined using the sequence comparison program NCBI-BLAST2 (Altschul et a!., Nucleic Acids 
Res. 25:3389-3402 (1997)). The NCBI-BLAST2 sequence comparison program may be downloaded from 
http://www.ncbi.nlm.nih.gov. NCBI-BLAST2 uses several search parameters, wherein all of those search parameters 
35 are set to default values including, for example, unmask = yes, strand = all, expected occurrences = 10, minimum 
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low complexity length = 15/5, multi-pass e-value = 0.01, constant for multi-pass = 25, dropoff for final gapped 
alignment = 25 and scoring matrix = BL0SUM62. 

In situations where NCBI-BLAST2 is employed for sequence comparisons, the % nucleic acid sequence 
identity of a given nucleic acid sequence C to, with, or against a given nucleic acid sequence D (which can 
5 alternatively be phrased as a given nucleic acid sequence C that has or comprises a certain % nucleic acid sequence 

identity to, with, or against a given nucleic acid sequence D) is calculated as follows: 

100 times the fraction W/Z 

10 where W is the number of nucleotides scored as identical matches by the sequence alignment program NCBI-BLAST2 

in that program's alignment of C and D, and where Z is the total number of nucleotides in D. It will be appreciated 
that where the length of nucleic acid sequence C is not equal to the length of nucleic acid sequence D, the % nucleic 
acid sequence identity of C to D will not equal the % nucleic acid sequence identity of D to C. 

In other embodiments, EG-VEGF variant polynucleotides are nucleic acid molecules that encode an active EG- 

15 VEGF polypeptide and which are capable of hybridizing, preferably under stringent hybridization and wash conditions, 

to nucleotide sequences encoding the full-length EG-VEGF polypeptide shown in Figure 2 (SEQ ID l\IO:2). EG-VEGF 
variant polypeptides may be those that are encoded by a EG-VEGF variant polynucleotide. 

The term "positives", in the context of the amino acid sequence identity comparisons performed as described 
above, includes amino acid residues in the sequences compared that are not only identical, but also those that have 

20 similar properties. Amino acid residues that score a positive value to an amino acid residue of interest are those that 

are either identical to the amino acid residue of interest or are a preferred substitution (as defined in Table 1 below) of 
the amino acid residue of interest. 

For purposes herein, the % value of positives of a given amino acid sequence A to, with, or against a given 
amino acid sequence B (which can alternatively be phrased as a given amino acid sequence A that has or comprises a 

25 certain % positives to, with, or against a given amino acid sequence B) is calculated as follows: 

100 times the fraction X/Y 

where X is the number of amino acid residues scoring a positive value as defined above by the sequence alignment 
30 program ALIGN-2 in that program's alignment of A and B, and where Y is the total number of amino acid residues in B. 

It will be appreciated that where the length of amino acid sequence A is not equal to the length of amino acid 

sequence B, the % positives of A to B will not equal the % positives of B to A. 

"Isolated," when used to describe the various polypeptides disclosed herein, means polypeptide that has 

been identified and separated and/or recovered from a component of its natural environment. Preferably, the isolated 
35 polypeptide is free of association with all components with which it is naturally associated. Contaminant components 
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of its natural environment are materials that would typically interfere with diagnostic or therapeutic uses for the 
polypeptide, and may include enzymes, hormones, and other proteinaceous or non-proteinaceous solutes. In preferred 
embodiments, the polypeptide will be purified (1) to a degree sufficient to obtain at least 15 residues of N-terminal or 
internal amino acid sequence by use of a spinning cup sequenator, or (2) to homogeneity by SDS-PAGE under non- 
5 reducing or reducing conditions using Coomassie blue or, preferably, silver stain. Isolated polypeptide includes 

polypeptide in situ within recombinant cells, since at least one component of the EG-VEGF natural environment will not 
be present. Ordinarily, however, isolated polypeptide will be prepared by at least one purification step. 

An "isolated" nucleic acid molecule encoding a EG-VEGF polypeptide is a nucleic acid molecule that is 
identified and separated from at least one contaminant nucleic acid molecule with which it is ordinarily associated in 

10 the natural source of the EG-VEGF-encoding nucleic acid. Preferably, the isolated nucleic is free of association with all 
components with which it is naturally associated. An isolated EG-VEGF-encoding nucleic acid molecule is other than in 
the form or setting in which it is found in nature. Isolated nucleic acid molecules therefore are distinguished from the 
EG-VEGF-encoding nucleic acid molecule as it exists in natural cells. However, an isolated nucleic acid molecule 
encoding a EG-VEGF polypeptide includes EG-VEGF-encoding nucleic acid molecules contained in cells that ordinarily 

15 express EG-VEGF where, for example, the nucleic acid molecule is in a chromosomal location different from that of 

natural cells. 

The term "control sequences" refers to DNA sequences necessary for the expression of an operably linked 
coding sequence in a particular host organism. The control sequences that are suitable for prokaryotes, for example, 
include a promoter, optionally an operator sequence, and a ribosome binding site. Eukaryotic cells are known to utilize 

20 promoters, polyadenylation signals, and enhancers. 

Nucleic acid is "operably linked" when it is placed into a functional relationship with another nucleic acid 
sequence. For example, DNA for a presequence or secretory leader is operably linked to DNA for a polypeptide if it is 
expressed as a preprotein that participates in the secretion of the polypeptide; a promoter or enhancer is operably 
linked to a coding sequence if it affects the transcription of the sequence; or a ribosome binding site is operably linked 

25 to a coding sequence if it is positioned so as to facilitate translation. Generally, "operably linked" means that the DNA 

sequences being linked are contiguous, and, in the case of a secretory leader, contiguous and in reading phase. 
However, enhancers do not have to be contiguous. Linking is accomplished by ligation at convenient restriction sites. 
If such sites do not exist, the synthetic oligonucleotide adaptors or linkers are used in accordance with conventional 
practice. 

30 The term "antibody" is used in the broadest sense and specifically covers, for example, single anti-EG-VEGF 

monoclonal antibodies (including agonist, antagonist, and neutralizing antibodies), anti-EG-VEGF antibody compositions 
with polyepitopic specificity, single chain anti-EG-VEGF antibodies, and fragments of anti-EG-VEGF antibodies (see 
below). The term "monoclonal antibody" as used herein refers to an antibody obtained from a population of 
substantially homogeneous antibodies, i.e., the individual antibodies comprising the population are identical except for 

3 5 possible naturally-occurring mutations that may be present in minor amounts. 
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"Stringency" of hybridization reactions is readily determinable by one of ordinary skill in the art, and 
generally is an empirical calculation dependent upon probe length, washing temperature, and salt concentration. In 
general, longer probes require higher temperatures for proper annealing, while shorter probes need lower 
temperatures. Hybridization generally depends on the ability of denatured DNA to reanneal when complementary 
5 strands are present in an environment below their melting temperature. The higher the degree of desired homology 

between the probe and hybridizable sequence, the higher the relative temperature, which can be used. As a result, it 
follows that higher relative temperatures would tend to make the reaction conditions more stringent, while lower 
temperatures less so. For additional details and explanation of stringency of hybridization reactions, see Ausubel et 
ai, Current Protocols in Molecular Biology , Wiley Interscience Publishers, (1995). 

10 "Stringent conditions" or "high stringency conditions", as defined herein, may be identified by those that: (1) 

employ low ionic strength and high temperature for washing, for example 0.015 M sodium chloride/0.0015 M sodium 
citrate/0. 1% sodium dodecyl sulfate at GO°C; (2) employ during hybridization a denaturing agent, such as formamide, 
for example, 50% (v/v) formamide with 0.1% bovine serum albumin/0.1 % Ficoll/0.1% polyvinylpyrrolidone/50mM 
sodium phosphate buffer at pH 6.5 with 750 mM sodium chloride, 75 mM sodium citrate at 42 D C; or (3) employ 50% 

15 formamide, 5 x SSC (0.75 M IMaCI, 0.075 M sodium citrate), 50 mM sodium phosphate (pH 6.8), 0.1% sodium 

pyrophosphate, 5 x Denhardt's solution, sonicated salmon sperm DMA (50 g/ml), 0.1% SDS, and 10% dextran sulfate 
at 42°C, with washes at 42°C in 0.2 x SSC (sodium chloride/sodium citrate) and 50% formamide at 55 a C, followed by 
a high-stringency wash consisting of 0.1 x SSC containing EDTA at 55°C. 

"Moderately stringent conditions" may be identified as described by Sambrook et ai, Molecular Cloning: A 

20 Laboratory Manual , New York: Cold Spring Harbor Press, 1989, and include the use of washing solution and 

hybridization conditions (e.g., temperature, ionic strength and %SDS) less stringent that those described above. An 
example of moderately stringent conditions is overnight incubation at 37°C in a solution comprising: 20% formamide, 
5 x SSC (150 mM IMaCI, 15 mM trisodium citrate), 50 mM sodium phosphate (pH 7.6), 5 x Denhardt's solution, 10% 
dextran sulfate, and 20 mg/ml denatured sheared salmon sperm DNA, followed by washing the filters in 1 x SSC at 

25 about 37-50 D C. The skilled artisan will recognize how to adjust the temperature, ionic strength, etc. as necessary to 

accommodate factors such as probe length and the like. 

The term "epitope tagged" when used herein refers to a chimeric polypeptide comprising a EG-VEGF 
polypeptide fused to a "tag polypeptide". The tag polypeptide has enough residues to provide an epitope against 
which an antibody can be made, yet is short enough such that it does not interfere with activity of the polypeptide to 

30 which it is fused. The tag polypeptide preferably also is fairly unique so that the antibody does not substantially 

cross-react with other epitopes. Suitable tag polypeptides generally have at least six amino acid residues and usually 
between about 8 and 50 amino acid residues (preferably, between about 10 and 20 amino acid residues). 

As used herein, the term "immunoadhesin" designates antibody-like molecules which combine the binding 
specificity of a heterologous protein (an "adhesin") with the effector functions of immunoglobulin constant domains. 

35 Structurally, the immunoadhesins comprise a fusion of an amino acid sequence with the desired binding specificity 

-15- 



WO 02/00711 



PCT/US01/20116 



which is other than the antigen recognition and binding site of an antibody (i.e., is "heterologous"), and an 
immunoglobulin constant domain sequence. The adhesin part of an immunoadhesin molecule typically is a contiguous 
amino acid sequence comprising at least the binding site of a receptor or a ligand. The immunoglobulin constant 
domain sequence in the immunoadhesin may be obtained from any immunoglobulin, such as lgG-1, lgG-2, lgG-3, or IgG- 
5 4 subtypes, IgA (including lgA-1 and lgA-2), IgE, IgD or IgM. 

"Active" or "activity" for the purposes herein refers to form(s) of EG-VEGF which retain a biological and/or 
an immunological activity of native or naturally-occurring EG-VEGF, wherein "biological" activity refers to a biological 
function (either inhibitory or stimulatory) caused by a native or naturally-occurring EG-VEGF other than the ability to 
induce the production of an antibody against an antigenic epitope possessed by a native or naturally-occurring EG- 

10 VEGF and an "immunological" activity refers to the ability to induce the production of an antibody against an antigenic 
epitope possessed by a native or naturally-occurring EG-VEGF. Biological activities specifically include regulation of 
cellular proliferation and chemotaxis. A preferred biologicai activity is the ability to regulate the production, growth, 
differentiation and/or migration of endothelial cells. Even more preferably, the biological activity is the ability to 
induce proliferation, migration and/or fenestrations in capillary endothelial cells, preferably steroidogenic cells, within 

15 endocrine glands. Another preferred biological activity is the ability to promote angiogenesis and/or regulate vascular 
permeability in endocrine glands, especially in steroidogenic tissues within endocrine glands. Yet another preferred 
biological activity is the ability to induce phosphorylation of a signaling molecule involved in cell proliferation and/or 
survival, such as MAP kinase, e.g. ERK1 or ERK2. 

The term "angiogenesis" is used in the broadest sense and specifically includes the creation of new blood 

20 vessels from existing vessels, just as the de novo assembly of endothelial progenitor cells into vessels through 

migration, proliferation, cell-cell aggregation, assembly and morphogenesis (also referred to as "vasculogenesis"). 

The term "antagonist" is used in the broadest sense, and includes any molecule that partially or fully blocks, 
inhibits, or neutralizes a biological activity of a native EG-VEGF polypeptide disclosed herein. In a similar manner, the 
term "agonist" is used in the broadest sense and includes any molecule that mimics a biological activity of a native EG- 

25 VEGF polypeptide disclosed herein. Suitable agonist or antagonist molecules specifically include agonist or antagonist 

antibodies or antibody fragments, fragments or amino acid sequence variants of native EG-VEGF polypeptides, 
peptides, small organic molecules, etc. Methods for identifying agonists or antagonists of a EG-VEGF polypeptide 
may comprise contacting a EG-VEGF polypeptide with a candidate agonist or antagonist molecule and measuring a 
detectable change in one or more biological activities normally associated with the EG-VEGF polypeptide. 

30 "Treatment" refers to both therapeutic treatment and prophylactic or preventative measures, wherein the 

object is to prevent or slow down (lessen) the targeted pathologic condition or disorder. Those in need of treatment 
include those already with the disorder as well as those prone to have the disorder or those in whom the disorder is to 
be prevented. For example, in tumor (e.g., cancer) treatment, a therapeutic agent may directly decrease the 
pathology of tumor cells, or render the tumor cells more susceptible to treatment by other therapeutic agents, e.g., 

35 radiation and/or chemotherapy. 
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"Steroidogenesis" is the hormonally induced, CAMP-mediated acute regulation of steroid hormone 
biosynthesis in "steroidogenic cells" characterized by the mobilization of cholesterol from cellular stores to the 
mitochondria outer membrane, and its translocation to the inner membrane where the conversion of cholesterol to 
pregnenolone occurs. 
5 "Steroidogenic tissue" refers to tissue which produces steroidal hormones by the process of steroidogenesis. 

Examples include tissues of the adrenal gland, the reproductive organs, gut and respiratory tract tissue. 

"Chronic" administration refers to administration of the agent(s) in a continuous mode as opposed to an 
acute mode, so as to maintain the initial therapeutic effect (activity) for an extended period of time. "Intermittent" 
administration is treatment that is not consecutively done without interruption, but rather is cyclic in nature. 

10 "Mammal" for purposes of treatment refers to any animal classified as a mammal, including humans, other 

higher primates, domestic and farm animals, and zoo, sports, or pet animals, such as dogs, cats, cattle, horses, sheep, 
pigs, goats, rabbits, etc. Preferably, the mammal is human. 

"Tumor", as used herein, refers to all neoplastic cell growth and proliferation, whether malignant or benign, 
and all pre-cancerous and cancerous cells and tissues. 

15 The terms "cancer" and "cancerous" refer to or describe the physiological condition in mammals that is 

typically characterized by unregulated cell growth. Examples of cancers of particular interest herein include cancers 
of the reproductive organs, e.g. ovarian cancer, testicular cancer, uterine cancer, cervical cancer; prostate cancer; 
cancers of the adrenal gland, including cancers of the adrenal cortex (e.g. adrenocortical carcinoma) and the adrenal 
medulla; thyroid cancer; parathyroid cancer; pancreatic cancer; and endometrial carcinoma. 

20 The "pathology" of a disease includes all phenomena that compromise the well-being of the patient. For 

cancer, this includes, without limitation, abnormal or uncontrollable cell growth, metastasis, interference with the 
normal functioning of neighboring cells, release of cytokines or other secretory products at abnormal levels, 
suppression or aggravation of inflammatory or immunological response, etc. 

Administration "in combination with" one or more further therapeutic agents includes simultaneous 

25 (concurrent) and consecutive administration in any order. 

"Carriers" as used herein include pharmaceutically acceptable carriers, excipients, or stabilizers which are 
nontoxic to the cell or mammal being exposed thereto at the dosages and concentrations employed. Often the 
physiologically acceptable carrier is an aqueous pH buffered solution. Examples of physiologically acceptable carriers 
include buffers such as phosphate, citrate, and other organic acids; antioxidants including ascorbic acid; low molecular 

30 weight (less than about 10 residues) polypeptide; proteins, such as serum albumin, gelatin, or immunoglobulins; 

hydrophilic polymers such as polyvinylpyrrolidone; amino acids such as glycine, glutamine, asparagine, arginine or 
lysine; monosaccharides, disaccharides, and other carbohydrates including glucose, mannose, or dextrins; chelating 
agents such as EDTA; sugar alcohols such as mannitol or sorbitol; salt-forming counterions such as sodium; and/or 
nonionic surfactants such as TWEEN™, polyethylene glycol (PEG), and PLUR01MICS™. 
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"Antibody fragments" comprise a portion of an intact antibody, preferably the antigen binding or variable 
region of the intact antibody. Examples of antibody fragments include Fab, Fab", F(ab'} 2 , and Fv fragments; diabodies; 
linear antibodies (Zapata et al.. Protein Eng. 8{10): 1057-1062 [1995]); single-chain antibody molecules; and 
multispecific antibodies formed from antibody fragments. 
5 Papain digestion of antibodies produces two identical antigen-binding fragments, called "Fab" fragments, 

each with a single antigen-binding site, and a residual "Fc" fragment, a designation reflecting the ability to crystallize 
readily. Pepsin treatment yields an F(ab') 2 fragment that has two antigen-combining sites and is still capable of cross- 
linking antigen. 

"Fv" is the minimum antibody fragment which contains a complete antigen-recognition and -binding site. 

10 This region consists of a dimer of one heavy- and one light-chain variable domain in tight, non-covalent association. It 
is in this configuration that the three CDRs of each variable domain interact to define an antigen-binding site on the 
surface of the VH-VL dimer. Collectively, the six CDRs confer antigen-binding specificity to the antibody. However, 
even a single variable domain (or half of an Fv comprising only three CDRs specific for an antigen) has the ability to 
recognize and bind antigen, although at a lower affinity than the entire binding site. 

15 The Fab fragment also contains the constant domain of the light chain and the first constant domain (CH1) 

of the heavy chain. Fab fragments differ from Fab' fragments by the addition of a few residues at the carboxy 
terminus of the heavy chain CHI domain including one or more cysteines from the antibody hinge region. Fab'-SH is 
the designation herein for Fab' in which the cysteine residue(s) of the constant domains bear a free thiol group. F(ab') 2 
antibody fragments originally were produced as pairs of Fab' fragments which have hinge cysteines between them. 

20 Other chemical couplings of antibody fragments are also known. 

The "light chains" of antibodies (immunoglobulins) from any vertebrate species can be assigned to one of 
two clearly distinct types, called kappa and lambda, based on the amino acid sequences of their constant domains. 

Depending on the amino acid sequence of the constant domain of their heavy chains, immunoglobulins can be 
assigned to different classes. There are five major classes of immunoglobulins: IgA, IgD, IgE, IgG, and IgM, and 

25 several of these may be further divided into subclasses (isotypes), e.g„ IgGI, lgG2, lgG3, lgG4, IgA, and lgA2. 

"Single-chain Fv" or "sFv" antibody fragments comprise the VH and VL domains of antibody, wherein these 
domains are present in a single polypeptide chain. Preferably, the Fv polypeptide further comprises a polypeptide 
linker between the VH and VL domains which enables the sFv to form the desired structure for antigen binding. For a 
review of sFv, see Pluckthun in The Pharmacology of Monoclonal Antibodies , vol. 113, Rosenburg and Moore eds., 

30 Springer-Verlag, New York, pp. 269-315 (1994). 

The term "diabodies" refers to small antibody fragments with two antigen-binding sites, which fragments 
comprise a heavy-chain variable domain (VH) connected to a light-chain variable domain (VL) in the same polypeptide 
chain (VH - VL). By using a linker that is too short to allow pairing between the two domains on the same chain, the 
domains are forced to pair with the complementary domains of another chain and create two antigen-binding sites. 



-18- 



WO 02/00711 



PCT/US01/20116 



Diabodies are described more fully in, for example, EP 404,097; WO 93(1 1161; and Hollinger et al., Proc. Natl. Acad. 
Sci. USA, 90:6444-6448(1993). 

An "isolated" antibody is one which has been identified and separated and/or recovered from a component of 
its natural environment. Contaminant components of its natural environment are materials which would interfere with 
5 diagnostic or therapeutic uses for the antibody, and may include enzymes, hormones, and other proteinaceous or 

nonproteinaceous solutes. In preferred embodiments, the antibody will be purified (1) to greater than 95% by weight 
of antibody as determined by the Lowry method, and most preferably more than 99% by weight, (2) to a degree 
sufficient to obtain at least 15 residues of N-terminal or internal amino acid sequence by use of a spinning cup 
sequenator, or (3) to homogeneity by SDS-PAGE under reducing or nonreducing conditions using Coomassie blue or, 
10 preferably, silver stain. Isolated antibody includes the antibody in situ within recombinant cells since at least one 
component of the antibody's natural environment will not be present. Ordinarily, however, isolated antibody will be 
prepared by at least one purification step. 

The word "label" when used herein refers to a detectable compound or composition which is conjugated 
directly or indirectly to the antibody so as to generate a "labeled" antibody. The label may be detectable by itself (e.g. 
15 radioisotope labels or fluorescent labels) or, in the case of an enzymatic label, may catalyze chemical alteration of a 

substrate compound or composition which is detectable. 

By "solid phase" is meant a non-aqueous matrix to which the antibody of the present invention can adhere. 
Examples of solid phases encompassed herein include those formed partially or entirely of glass (e.g., controlled pore 
glass), polysaccharides (e.g., agarose), polyacrylamides, polystyrene, polyvinyl alcohol and silicones. In certain 
20 embodiments, depending on the context, the solid phase can comprise the well of an assay plate; in others it is a 
purification column (e.g., an affinity chromatography column). This term also includes a discontinuous solid phase of 
discrete particles, such as those described in U.S. Patent No. 4,275,149. 

A "liposome" is a small vesicle composed of various types of lipids, phospholipids and/or surfactant which is 
useful for delivery of a drug (such as a EG-VEGF polypeptide or antibody thereto) to a mammal. The components of 
25 the liposome are commonly arranged in a hilayer formation, similar to the lipid arrangement of biological membranes. 

A "small molecule" is defined herein to have a molecular weight below about 500 Daltons. 
The terms "vascular endothelial growth factor", "VEGF", "VEGF polypeptide" and "VEGF protein" when used 
herein encompass native sequence VEGF and VEGF variants (which are further defined herein). The VEGF polypeptide 
may be isolated from a variety of sources, such as from human tissue types or from another source, or prepared by 
30 recombinant and/or synthetic methods. 

A "native sequence VEGF" comprises a polypeptide having the same amino acid sequence as a VEGF derived 
from nature. Such native sequence VEGF can be isolated from nature or can be produced by recombinant and/or 
synthetic means. The term "native sequence VEGF" specifically encompasses naturally-occurring truncated or 
secreted forms (e.g., an extracellular domain sequence), naturally-occurring variant forms \e.g., alternatively spliced 
35 forms) and naturally-occurring allelic variants of the VEGF. In one embodiment of the invention, the native sequence 
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VEGF is one of the five known isoforms, consisting of 121, 145, 165, 189, and 206 amino acid residues, respectively, 
as described, for example in U.S. Patent Nos. 5,332,671 and 5,240,848; in PCT Publication No. WO 98/10071; 
Leung et aL Science 246:1306-1309 (1989); and Keck et al.. Science 246: 1309-1312 (1989). 

"VEGF variant polypeptide" means an active VEGF polypeptide as defined below having at least about 80%, 
5 preferably at least about 85%, more preferably at least about 90%, event more preferably at least about 95%, most 

preferably at least about 98% amino acid sequence identity with the amino acid sequence of a native sequence VEGF. 
Such VEGF variant polypeptides include, for instance, VEGF polypeptides wherein one or more amino acid residues are 
added, or deleted, at the and/or C-terminus, as well as within one or more internal domains, of the native sequence. 

The sequence identity (either amino acid or nucleic acid) for VEGF is determined using the same approach 
10 specifically described with regard to EG-VEGF. Similarly, the definitions provided for agonist and antagonists of EG- 
VEGF, including but not limited to antibodies, will apply to VEGF agonists and antagonists. 

II. Compositions and Methods of the Invention 

A. Full-length EG-VEGF Polypeptide 

15 The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 

referred to in the present application as EG-VEGF (or also UN0600). In particular, cDNA encoding a EG-VEGF 
polypeptide has been identified and isolated, as disclosed in further detail in the Examples below. It is noted that 
proteins produced in separate expression rounds may be given different PRO numbers but the UNO number is unique 
for any given DMA and the encoded protein, and will not be changed. However, for sake of simplicity, in the present 

20 specification the protein encoded by DNA60621-1 516 as well as all further native homologues and variants included 

in the foregoing definition of EG-VEGF, will be referred to as "EG-VEGF", regardless of their origin or mode of 
preparation. 

As disclosed in the Examples below, a cDNA clone designated herein as DNA6062M516 has been 
deposited with the ATCC. The actual nucleotide sequence of the clone can readily be determined by the skilled artisan 

25 by sequencing of the deposited clone using routine methods in the art. The predicted amino acid sequence can be 

determined from the nucleotide sequence using routine skill. For the EG-VEGF polypeptide and encoding nucleic acid 
described herein, Applicants have identified what is believed to be the reading frame best identifiable with the 
sequence information available at the time. 

Using the ALIGN-2 sequence alignment computer program referenced above, it has been found that the full- 

30 length native sequence EG-VEGF (shown in Figure 2 and SEQ ID N0:2) has certain amino acid sequence identity with 

EG-VEGFJ3ENP0, a protein from black mambo venom. However, no significant sequence identities to any known 
mammalian proteins were revealed. 
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In addition to the full-length native sequence EG-VEGF polypeptides described herein, it is contemplated that 
EG-VEGF variants can be prepared. EG-VEGF variants can be prepared by introducing appropriate nucleotide changes 
into the EG-VEGF DNA, and/or by synthesis of the desired EG-VEGF polypeptide. Those skilled in the art will 
5 appreciate that amino acid changes may alter post-translational processes of the EG-VEGF, such as changing the 

number or position of glycosylation sites or altering the membrane anchoring characteristics. 

Variations in the native full-length sequence EG-VEGF or in various domains of the EG-VEGF described herein, 
can be made, for example, using any of the techniques and guidelines for conservative and non-conservative mutations 
set forth, for instance, in U.S. Patent No. 5,364,934. Variations may be a substitution, deletion or insertion of one or 

10 more codons encoding the EG-VEGF that results in a change in the amino acid sequence of the EG-VEGF as compared 

with the native sequence EG-VEGF. Optionally the variation is by substitution of at least one amino acid with any 
other amino acid in one or more of the domains of the EG-VEGF. Guidance in determining which amino acid residue 
may be inserted, substituted or deleted without adversely affecting the desired activity may be found by comparing 
the sequence of the EG-VEGF with that of homologous known protein molecules and minimizing the number of amino 

15 acid sequence changes made in regions of high homology. Amino acid substitutions can be the result of replacing one 

amino acid with another amino acid having similar structural and/or chemical properties, such as the replacement of a 
leucine with a serine, i.e., conservative amino acid replacements. Insertions or deletions may optionally be in the 
range of about 1 to 5 amino acids. The variation allowed may be determined by systematically making insertions, 
deletions or substitutions of amino acids in the sequence and testing the resulting variants for activity exhibited by the 

20 full-length or mature native sequence. 

EG-VEGF polypeptide fragments are provided herein. Such fragments may be truncated at the N-terminus or 
C-terminus, or may lack internal residues, for example, when compared with a full length native protein. Certain 
fragments lack amino acid residues that are not essential for a desired biological activity of the EG-VEGF polypeptide. 
EG-VEGF fragments may be prepared by any of a number of conventional techniques. Desired peptide 

25 fragments may be chemically synthesized. An alternative approach involves generating EG-VEGF fragments by 

enzymatic digestion, e.g., by treating the protein with an enzyme known to cleave proteins at sites defined by 
particular amino acid residues, or by digesting the DNA with suitable restriction enzymes and isolating the desired 
fragment. Yet another suitable technique involves isolating and amplifying a DNA fragment encoding a desired 
polypeptide fragment, by polymerase chain reaction (PCR). Oligonucleotides that define the desired termini of the 

30 DNA fragment are employed at the 5' and 3' primers in the PCR. Preferably, EG-VEGF polypeptide fragments share at 

least one biological and/or immunological activity with the native EG-VEGF polypeptide shown in Figure 2 (SEQ ID 
N0:2). 

In particular embodiments, conservative substitutions of interest are shown in Table 1 under the heading of 
preferred substitutions. If such substitutions result in a change in biological activity, then more substantial changes, 
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denominated exemplary substitutions in Table 1, or as further described below in reference to amino acid classes, are 
introduced and the products screened. 

Table 1 

5 





R^'tjue 




Preferred 




_es_ue 


Substitutions 






Ala (A) 


vsl; Igu; lis 


val 


10 


Arg (R) 


lys; gin; asn 


lys 




Asn (l\l) 


gin; his; lys; srg 






Asp (D) 




rjl" 




Cys (C) 


ser 


ser 




Gin (Q) 






15 


Glu (E) 


asp 


asp 




Gly (G) 


pro; ala 


ala 




His (H) 


asn; gin; lys; arg 


arg 




He (IJ 










norleucine 


leu 


20 


Leu (L) 


norleucine; He; val; 








met; ala; phe 


ile 




Lys (K) 


arg; gin; asn 


arg 




Met (M) 


leu; phe; ile 


leu 




Phe (F) 


leu; val; ile; ala; tyr 


leu 


25 


Pro (P) 


ala 


ala 




Ser (S) 


thr 


thr 




Thr(T) 


ser 


ser 




Trp(W) 


tyr; phe 


tyr 




Tyr(Y) 


trp; phe; thr; ser 


phe 


30 


Val (V) 


ile; leu; met; phe; 








ala; norleucine 


leu 



Substantial modifications in function or immunological identity of the EG-VEGF polypeptide are accomplished 
by selecting substitutions that differ significantly in their effect on maintaining (a) the structure of the polypeptide 
35 backbone in the area of the substitution, for example, as a sheet or helical conformation, (b) the charge or 

hydrophobicity of the molecule at the target site, or (c) the bulk of the side chain. Naturally occurring residues are 
divided into groups based on common side-chain properties: 

(1) hydrophobic: norleucine, met, ala, val, leu, ile; 

(2) neutral hydrophilic: cys, ser, thr; 
40 (3) acidic: asp, glu; 

(4) basic: asn, gin, his, lys, arg; 

(5) residues that influence chain orientation: gly, pro; and 

(6) aromatic: trp, tyr, phe. 
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Non-conservative substitutions will entail exchanging a member of one of these classes for another class. 
Such substituted residues also may be introduced into the conservative substitution sites or, more preferably, into the 
remaining (non-conserved) sites. 

The variations can be made using methods known in the art such as oligonucleotide-mediated (site-directed) 
5 mutagenesis, alanine scanning, and PCR mutagenesis. Site-directed mutagenesis [Carter et al., IMucl. Acids Res. , 

13:4331 (1986); Zoller et al., IMucl. Acids Res. , 10:6487 (1987)], cassette mutagenesis [Wells et al.. Gene, 34:315 
(1985)], restriction selection mutagenesis [Wells et al., Philos. Trans. R. Soc. London SerA , 317:415 (1986)] or other 
known techniques can be performed on the cloned DNA to produce the EG-VEGF variant DMA. 

Scanning amino acid analysis can also be employed to identify one or more amino acids along a contiguous 
10 sequence. Among the preferred scanning amino acids are relatively small, neutral amino acids. Such amino acids 

include alanine, glycine, serine, and cysteine. Alanine is typically a preferred scanning amino acid among this group 
because it eliminates the side-chain beyond the beta-carbon and is less likely to alter the main-chain conformation of 
the variant [Cunningham and Wells, Science , 244 : 1081-1085 (1989)]. Alanine is also typically preferred because it is 
the most common amino acid. Further, it is frequently found in both buried and exposed positions [Creighton, The 
15 Proteins , (W.H. Freeman & Co., N.Y.); Chothia, J. Mol. Biol. , 150:1 (1976)]. If alanine substitution does not yield 

adequate amounts of variant, an isoteric amino acid can be used. 



C. Modifications of EG-VEGF 
Covalent modifications of EG-VEGF are included within the scope of this invention. One type of covalent 
20 modification includes reacting targeted amino acid residues of a EG-VEGF polypeptide with an organic derivatizing 
agent that is capable of reacting with selected side chains or the N- or C- terminal residues of the EG-VEGF. 
Derivatization with Afunctional agents is useful, for instance, for crosslinking EG-VEGF to a water-insoluble support 
matrix or surface for use in the method for purifying anti-EG-VEGF antibodies, and vice-versa. Commonly used 
crosslinking agents include, e.g., 1,1-bis(diazoacetyl)-2-phenylethane, glutaraldehyde, l\l-hydroxysuccinimide esters, for 
25 example, esters with 4-azidosalicylic acid, homobifunctional imidoesters, including disuccinimidyl esters such as 3,3'- 
dithiobis(succinimidylpropionate), Afunctional maleimides such as bis-l\l-maleimido-1,8-octane and agents such as 
methyl-3-[(p-azidophenyl)dithio]propioimidate. 

Other modifications include deamidation of glutaminyl and asparaginyl residues to the corresponding 
glutamyl and aspartyl residues, respectively, hydroxylation of proline and lysine, phosphorylation of hydroxyl groups of 
30 seryl or threonyl residues, methylation of the -amino groups of lysine, arginine, and histidine side chains [T.E. 

Creighton, Proteins: Structure and Molecular Properties, W.H. Freeman & Co., San Francisco, pp. 79-86 (1983)], 
acetylation of the N-terminal amine, and amidation of any C-terminal carboxyl group. 

Another type of covalent modification of the EG-VEGF polypeptide included within the scope of this invention 
comprises altering the native glycosylation pattern of the polypeptide. "Altering the native glycosylation pattern" is 
35 intended for purposes herein to mean deleting one or more carbohydrate moieties found in native sequence EG-VEGF 
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(either by removing the underlying glycosylate site or by deleting the glycosylation by chemical and/or enzymatic 
means), and/or adding one or more glycosylation sites that are not present in the native sequence EG-VEGF. In 
addition, the phrase includes qualitative changes in the glycosylation of the native proteins, involving a change in the 
nature and proportions of the various carbohydrate moieties present. 
5 Addition of glycosylation sites to the EG-VEGF polypeptide may be accomplished by altering the amino acid 

sequence. The alteration may be made, for example, by the addition of, or substitution by, one or more serine or 
threonine residues to the native sequence EG-VEGF (for O-linked glycosylation sites). The EG-VEGF amino acid 
sequence may optionally be altered through changes at the DNA level, particularly by mutating the DMA encoding the 
EG-VEGF polypeptide at preselected bases such that codons are generated that will translate into the desired amino 
10 acids. 

Another means of increasing the number of carbohydrate moieties on the EG-VEGF polypeptide is by 
chemical or enzymatic coupling of glycosides to the polypeptide. Such methods are described in the art, e.g., in WO 
87/05330 published 1 1 September 1987, and in Aplin and Wriston, CRC Crit. Rev. Biochem. , pp. 259-306 (1981). 

Removal of carbohydrate moieties present on the EG-VEGF polypeptide may be accomplished chemically or 

15 enzymatically or by mutational substitution of codons encoding for amino acid residues that serve as targets for 

glycosylation. Chemical deglycosylation techniques are known in the art and described, for instance, by Hakimuddin, 
et al., Arch. Biochem. Biophys. , 259:52 (1987) and by Edge et al., Anal. Biochem. , 118:131 (1981). Enzymatic 
cleavage of carbohydrate moieties on polypeptides can be achieved by the use of a variety of endo- and exo- 
glycosidases as described by Thotakura et al., Meth. Enzymol. , 138:350 (1 987). 

20 Another type of covalent modification of EG-VEGF comprises linking the EG-VEGF polypeptide to one of a 

variety of nonproteinaceous polymers, e.g., polyethylene glycol (PEG), polypropylene glycol, or polyoxyalkylenes, in the 
manner set forth in U.S. Patent Nos. 4,640,835; 4,496,689; 4,301,144; 4,670,417; 4,791,192 or 4,179,337. 

The EG-VEGF of the present invention may also be modified in a way to form a chimeric molecule comprising 
EG-VEGF fused to' another, heterologous polypeptide or amino acid sequence. 

25 In one embodiment, such a chimeric molecule comprises a fusion of the EG-VEGF with a tag polypeptide 

which provides an epitope to which an anti-tag antibody can selectively bind. The epitope tag is generally placed at 
the amino- or carboxyl- terminus of the EG-VEGF. The presence of such epitope-tagged forms of the EG-VEGF can be 
detected using an antibody against the tag polypeptide. Also, provision of the epitope tag enables the EG-VEGF to be 
readily purified by affinity purification using an anti-tag antibody or another type of affinity matrix that binds to the 

30 epitope tag. Various tag polypeptides and their respective antibodies are well known in the art. Examples include 

poly-histidine (poly-his) or poly-histidine-glycine (poly-his-gly) tags; the flu HA tag polypeptide and its antibody 12CA5 
[Field et al., Mol. Cell. Biol. , 8:2159-2165 (1988)]; the c-myc tag and the 8F9, 3C7, 6E10, G4, B7 and 9E10 
antibodies thereto [Evan et al., Molecular and Cellular Biology , 5:3610-3616 (1985)]; and the Herpes Simplex virus 
glycoprotein D (gD) tag and its antibody [Paborsky et al. Protein Engineering , 3(6):547-553 (1990)]. Other tag 

35 polypeptides include the Flag-peptide [Hopp et al., BioTechnolopy , 6:1204-1210 (1988)]; the KT3 epitope peptide 
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[Martin et al., Science , 255:192-194 (1992)]; an -tubulin epitope peptide [Skinner et al., J. Biol. Chem. , 266:15163- 
15166 (1991)]; and the T7 gene 10 protein peptide tag [Lutz-Freyermuth et al., Proc. Natl. Acad. Sci. USA , 87:6393- 
6397(1990)]. 

In an alternative embodiment, the chimeric molecule may comprise a fusion of the EG-VEGF with an 
immunoglobulin or a particular region of an immunoglobulin. For a bivalent form of the chimeric molecule (also referred 
to as an "immunoadhesin"), such a fusion could be to the Fc region of an IgG molecule. The Ig fusions preferably 
include the substitution of a soluble (transmembrane domain deleted or inactivated) form of a EG-VEGF polypeptide in 
place of at least one variable region within an !g molecule. In a particularly preferred embodiment, the immunoglobulin 
fusion includes the hinge, CH2 and CH3, or the hinge, CH1, CH2 and CH3 regions of an lgG1 molecule. For the 
production of immunoglobulin fusions see also US Patent No. 5,428,130 issued June 27, 1995. 

D. Preparation of EG-VEGF 

The description below relates primarily to production of EG-VEGF by culturing cells transformed or 
transfected with a vector containing EG-VEGF nucleic acid. It is, of course, contemplated that alternative methods, 
which are well known in the art, may he employed to prepare EG-VEGF. For instance, the EG-VEGF sequence, or 
portions thereof, may be produced by direct peptide synthesis using solid-phase techniques [see, e.g., Stewart et al., 
Solid-Phase Peptide Synthesis , W.H. Freeman Co., San Francisco, CA (1969); Merrifield, J. Am. Chem. Soc , 85:2149- 
2154 (1963)]. In vitro protein synthesis may be performed using manual techniques or by automation. Automated 
synthesis may be accomplished, for instance, using an Applied Biosystems Peptide Synthesizer (Foster City, CA) using 
manufacturer's instructions. Various portions of the EG-VEGF may be chemically synthesized separately and combined 
using chemical or enzymatic methods to produce the full-length EG-VEGF. 

1. Isolation of PIMA Encoding EG-VEGF 
DNA encoding EG-VEGF may be obtained from a cDNA library prepared from tissue believed to possess the 
EG-VEGF mRNA and to express it at a detectable level. Accordingly, human EG-VEGF DNA can be conveniently 
obtained from a cDNA library prepared from human tissue, such as described in the Examples. The EG-VEGF-encoding 
gene may also be obtained from a genomic library or by known synthetic procedures (e.g., automated nucleic acid 
synthesis). 

Libraries can be screened with probes (such as antibodies to the EG-VEGF or oligonucleotides of at least 
about 20-80 bases) designed to identify the gene of interest or the protein encoded by it. Screening the cDNA or 
genomic library with the selected probe may be conducted using standard procedures, such as described in Sambrook 
et al., Molecular Cloning: A Laboratory Manual (New York: Cold Spring Harbor Laboratory Press, 1989). An 
alternative means to isolate the gene encoding EG-VEGF is to use PCR methodology [Sambrook et al., supra; 
Dieffenbach et al., PCR Primer: A Laboratory Manual (Cold Spring Harbor Laboratory Press, 1995)]. 
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The Examples below describe techniques for screening a cONA library. The oligonucleotide sequences 
selected as probes should be of sufficient length and sufficiently unambiguous that false positives are minimized. The 
oligonucleotide is preferably labeled such that it can be detected upon hybridization to DI\IA in the library being 
screened. Methods of labeling are well known in the art, and include the use of radiolabels like 32 P-labeled ATP, 
5 biotinylation or enzyme labeling. Hybridization conditions, including moderate stringency and high stringency, are 

provided in Sambrook et al., supra . 

Sequences identified in such library screening methods can be compared and aligned to other known 
sequences deposited and available in public databases such as GenBank or other private sequence databases. 
Sequence identity (at either the amino acid or nucleotide level) within defined regions of the molecule or across the 
10 full-length sequence can be determined using methods known in the art and as described herein. 

Nucleic acid having protein coding sequence may be obtained by screening selected cDNA or genomic 
libraries using the deduced amino acid sequence disclosed herein for the first time, and, if necessary, using 
conventional primer extension procedures as described in Sambrook et al., supra , to detect precursors and processing 
intermediates of mRNA that may not have been reverse-transcribed into cDNA. 

15 

2. Selection and Transformation of Host Cells 
Host cells are transfected or transformed with expression or cloning vectors described herein for EG-VEGF 
production and cultured in conventional nutrient media modified as appropriate for inducing promoters, selecting 
transformants, or amplifying the genes encoding the desired sequences. The culture conditions, such as media, 

20 temperature, pH and the like, can be selected by the skilled artisan without undue experimentation. In general, 

principles, protocols, and practical techniques for maximizing the productivity of cell cultures can be found in 
Mammalian Cell Biotechnology: a Practical Approach , M. Butler, ed. (IRL Press, 1991) and Sambrook et al., supra . 

Methods of eukaryotic cell transfection and prokaryotic cell transformation are known to the ordinarily 
skilled artisan, for example, CaCl z , CaP0 4 , liposome-mediated and electroporation. Depending on the host cell used, 

25 transformation is performed using standard techniques appropriate to such cells. The calcium treatment employing 

calcium chloride, as described in Sambrook et al., supra , or electroporation is generally used for prokaryotes. Infection 
with Agrobacterium tumefaciens is used for transformation of certain plant cells, as described by Shaw et al., Gene, 
23:315 (1983) and WO 89/05859 published 29 June 1989. For mammalian cells without such cell walls, the calcium 
phosphate precipitation method of Graham and van der Eb, Virology , 52:456-457 (1978) can be employed. General 

30 aspects of mammalian cell host system transfections have been described in U.S. Patent No. 4,399,216. 

Transformations into yeast are typically carried out according to the method of Van Solingen et al., J. Bact. , 130:946 
(1977) and Hsiao et al., Proc. Natl. Acad. Sci. (USA) , 76:3829 (1979). However, other methods for introducing DNA 
into cells, such as by nuclear microinjection, electroporation, bacterial protoplast fusion with intact cells, or 
polycations, e.g., polybrene, polyornithine, may also be used. For various techniques for transforming mammalian 
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cells, ses Keown et al.. Methods in Enzvmology, 185:527-537 (1990) and Mansour et al., Nature , 336:348-352 
(1988). 

Suitable host cells for cloning or expressing the DNA in the vectors herein include prokaryote, yeast, or 
higher eukaryote cells. Suitable prokaryotes include but are not limited to eubacteria, such as Gram-negative or Gram- 
5 positive organisms, for example, Enterobacteriaceae such as £ coli. Various £ coli strains are publicly available, such 

as£ C0//K12 strain MM294 (ATCC 31,446);£ co//X1776 (ATCC 31,537);£ coli strain W3110 (ATCC 27,325) and 
K5 772 (ATCC 53,635). Other suitable prokaryotic host cells include Enterobacteriaceae such as Escherichia, e.g., £ 
coli, Enterobacter, Erwinia, Klebsiella, Proteus, Salmonella, e.g.. Salmonella typhimurium, Serratia, e.g., Serratia 
marcescans, and Shigella, as well as Bacilli such as B. subtiiis and B. iicheniformis (e.g., B. licheniformis 41 P disclosed 

10 in DD 266,710 published 12 April 1989), Pseudomonas such as P. aeruginosa, and Streptomyces. These examples 

are illustrative rather than limiting. Strain W3110 is one particularly preferred host or parent host because it is a 
common host strain for recombinant DNA product fermentations. Preferably, the host cell secretes minimal amounts 
of proteolytic enzymes. For example, strain W3110 may be modified to effect a genetic mutation in the genes 
encoding proteins endogenous to the host, with examples of such hosts including £ coli W31 10 strain 1 A2, which has 

15 the complete genotype tonA ; £ coli W31 10 strain 9E4, which has the complete genotype tonA ptr3; £ coli W31 10 

strain 27C7 (ATCC 55,244), which has the complete genotype tonA ptr3 phoA El 5 fargF-/acJW9 degP ompTkarf; £ 
CO//W31 10 strain 37D6, which has the complete genotype tonA ptr3phoA E15 (argF-lac)169 degP ompT rbs7ilvG 
kaif; £ coli W31 10 strain 40B4, which is strain 37D6 with a non-kanamycin resistant degP deletion mutation; and an 
£ coli strain having mutant periplastic protease disclosed in U.S. Patent No. 4,946,783 issued 7 August 1990. 

20 Alternatively, in vitro methods of cloning, e.g., PCR or other nucleic acid polymerase reactions, are suitable. 

In addition to prokaryotes, eukaryotic microbes such as filamentous fungi or yeast are suitable cloning or 
expression hosts for EG-VEGF-encoding vectors. Saccharomyces cerevisiae is a commonly used lower eukaryotic host 
microorganism. Others include Schizosaccharomyces pombe (Beach and Nurse, Nature , 290: 140 [1981]; EP 139,383 
published 2 May 1985); Kluyveromyces hosts (U.S. Patent No. 4,943,529; Fleer et al., BioJTechnoloav , 9:968-975 

25 (1991)) such as, e.g., K. lactis (MW98-8C, CBS683, CBS4574; Louvencourt et al., J. BacterioL 154(21:737-742 

[1983]), K. fragilis (ATCC 12,424), K. bulgaricus (ATCC 16,045], K. wickeramii (ATCC 24,178), K. waltii (ATCC 
56,500), K. drosophilarum (ATCC 36,906; Van den Berg et al., Bio/Technology , 8:135 (1990)), K. thermotolerans, and 
K. marxianus; yarrowia (EP 402,226); Pichia pastoris (EP 183,070; Sreekrishna et al., J. Basic Microbiol. , 28:265- 
278 [1988]); Candida; Trichoderma reesia (EP 244,234); Neurospora crassa (Case et al., Proc. Natl. Acad. Sci. USA , 

30 76:5259-5263 [1979]); Schwanniomyces such as Schwanniomyces occ/denta/is (EP 394,538 published 31 October 

1990); and filamentous fungi such as, e.g., Neurospora, Penicillium, Tolypocladium (WO 91/00357 published 10 
January 1991), and Aspergillus hosts such as A. nidulans (Ballance et al., Biochem. Biophys. Res. Commun., 1 12:284- 
289 [1983]; Tilburn et al., Gene, 26:205-221 [1983]; Yelton et al., Proc. Natl. Acad. Sci. USA , 81: 1470-1474 
[1984]) and A. niger (Kelly and Hynes, EMBO J. , 4:475-479 [1985]). Methylotropic yeasts are suitable herein and 

35 include, but are not limited to, yeast capable of growth on methanol selected from the genera consisting of Hansenula, 
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Candida, Kloeckera, Pichia, Saccharomyces, Torulopsis, and Rhodotorula. A list of specific species that are exemplary 
of this class of yeasts may be found in C. Anthony, The Biochemistry of Methvlotrophs , 269 (1982). 

Suitable host cells for the expression of glycosylated EG-VEGF are derived from multicellular organisms. 
Examples of invertebrate cells include insect cells such as Drosophila S2 and Spodoptera Sf9, as well as plant cells. 
5 Examples of useful mammalian host cell lines include Chinese hamster ovary (CHO) and CDS cells. More specific 

examples include monkey kidney CV1 line transformed by SV40 (COS- 7, ATCC CRL 1651); human embryonic kidney 
line (293 or 293 cells subcloned for growth in suspension culture, Graham et al., J. Gen Virol., 36:59 (1977)); Chinese 
hamster ovary cells/-DHFR (CHO, Urlaub and Chasin, Proc. Natl. Acad. Sci. USA . 77:4216 (1980)); mouse Sertoli cells 
(TM4, Mather. Biol. Reprod. . 23:243-251 (1980)); human lung cells (W138, ATCC CCL 75); human liver cells (Hep G2, 
10 HB 8065); and mouse mammary tumor (MMT 060562, ATCC CCL51). The selection of the appropriate host cell is 

deemed to be within the skill in the art. 



3. Selection and Use of a Replicable Vector 

15 The nucleic acid (e.g., cDNA or genomic DMA) encoding EG-VEGF may be inserted into a replicable vector for 

cloning (amplification of the DNA) or for expression. Various vectors are publicly available. The vector may, for 
example, be in the form of a plasmid, cosmid, viral particle, or phage. The appropriate nucleic acid sequence may be 
inserted into the vector by a variety of procedures. In general, DNA is inserted into an appropriate restriction 
endonuclease site(s) using techniques known in the art. Vector components generally include, but are not limited to, 

20 one or more of a signal sequence, an origin of replication, one or more marker genes, an enhancer element, a promoter, 

and a transcription termination sequence. Construction of suitable vectors containing one or more of these 
components employs standard ligation techniques which are known to the skilled artisan. 

The EG-VEGF may be produced recombinantly not only directly, but also as a fusion polypeptide with a 
heterologous polypeptide, which may be a signal sequence or other polypeptide having a specific cleavage site at the 

25 N-terminus of the mature protein or polypeptide. In general, the signal sequence may be a component of the vector, or 

it may be a part of the EG-VEGF-encoding DNA that is inserted into the vector. The signal sequence may be a 
prokaryotic signal sequence selected, for example, from the group of the alkaline phosphatase, penicillinase, Ipp, or 
heat-stable enterotoxin II leaders. For yeast secretion the signal sequence may be, e.g., the yeast invertase leader, 
alpha factor leader (including Saccharomyces and Kkyveromyces -factor leaders, the latter described in U.S. Patent 

30 No. 5,010,182), or acid phosphatase leader, the C. albicans glucoamylase leader (EP 362,179 published 4 April 1990), 

or the signal described in WO 90/13646 published 15 November 1990. In mammalian cell expression, mammalian 
signal sequences may be used to direct secretion of the protein, such as signal sequences from secreted polypeptides 
of the same or related species, as well as viral secretory leaders. 

Both expression and cloning vectors contain a nucleic acid sequence that enables the vector to replicate in 

35 one or more selected host cells. Such sequences are well known for a variety of bacteria, yeast, and viruses. The 
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origin of replication from the plasmid pBR322 is suitable for most Gram-negative bacteria, the 2 plasmid origin is 
suitable for yeast, and various viral origins (SV4Q, polyoma, adenovirus, VSV or BPV) are useful for cloning vectors in 
mammalian cells. 

Expression and cloning vectors will typically contain a selection gene, also termed a selectable marker. 
5 Typical selection genes encode proteins that (a) confer resistance to antibiotics or other toxins, e.g., ampicillin, 
neomycin, methotrexate, or tetracycline, (b) complement auxotrophic deficiencies, or (c) supply critical nutrients not 
available from complex media, e.g., the gene encoding D-alanine racemase for Bacilli. 

An example of suitable selectable markers for mammalian cells are those that enable the identification of 
cells competent to take up the EG-VEGF-encoding nucleic acid, such as DHFR or thymidine kinase. An appropriate host 
10 cell when wild-type DHFR is employed is the CHO cell line deficient in DHFR activity, prepared and propagated as 
described by Urlaub et a!., Proc. Natl. Acad. Sci. USA , 77:4216 (1980). A suitable selection gene for use in yeast is 
the trp\ gene present in the yeast plasmid YRp7 [Stinchcomb et al., Nature , 282:39 (1979); Kingsman et al., Gene , 
7:141 (1979); Tschemper et al.. Gene, 10:157(1980)]. Jhetrp] gene provides a selection marker for a mutant strain 
of yeast lacking the ability to grow in tryptophan, for example, ATCC No. 44076 or PEP4-1 [Jones, Genetics , 85:12 
15 (1977)]. 

Expression and cloning vectors usually contain a promoter operably linked to the EG-VEGF-encoding nucleic 

acid sequence to direct mRNA synthesis. Promoters recognized by a variety of potential host cells are well known. 

Promoters suitable for use with prokaryotic hosts include the -lactamase and lactose promoter systems [Chang et al., 

Nature , 275:615 (1978); Goeddel et al.. Nature , 281:544 (1979)], alkaline phosphatase, a tryptophan (trp) promoter 
20 system [Goeddel, Nucleic Acids Res. , 8:4057 (1980); EP 36,776], and hybrid promoters such as the tac promoter 

[deBoer et al., Proc. Natl. Acad. Sci. USA, 80:21-25 (1983)]. Promoters for use in bacterial systems also will contain 

a Shine-Dalgarno (S.D.) sequence operably linked to the DNA encoding EG-VEGF. 

Examples of suitable promoting sequences for use with yeast hosts include the promoters for 3- 

phosphoglycerate kinase [Hitzeman et al., J. Biol. Chem. , 255:2073 (1980)] or other glycolytic enzymes [Hess et al., 
25 J. Adv. Enzyme Reg. , 7:149 (1968); Holland, Biochemistry , 17:4900 (1978)], such as enolase, glyceraldehyde-3- 

phosphate dehydrogenase, hexokinase, pyruvate decarboxylase, phosphofructokinase, glucose-6-phosphate isomerase, 

3-phosphoglycerate mutase, pyruvate kinase, triosephosphate isomerase, phosphoglucose isomerase, and glucokinase. 
Other yeast promoters, which are inducible promoters having the additional advantage of transcription 

controlled by growth conditions, are the promoter regions for alcohol dehydrogenase 2, isocytochrome C, acid 
30 phosphatase, degradative enzymes associated with nitrogen metabolism, metallothionein, glyceraldehyde-3-phosphate 

dehydrogenase, and enzymes responsible for maltose and galactose utilization. Suitable vectors and promoters for use 

in yeast expression are further described in EP 73,657. 

EG-VEGF transcription from vectors in mammalian host cells is controlled, for example, by promoters 

obtained from the genomes of viruses such as polyoma virus, fowlpox virus (UK 2,211,504 published 5 July 1989), 
35 adenovirus (such as Adenovirus 2], bovine papilloma virus, avian sarcoma virus, cytomegalovirus, a retrovirus, 
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hepatitis-B virus and Simian Virus 40 (SV40), from heterologous mammalian promoters, e.g., the actin promoter or an 
immunoglobulin promoter, and from heat-shock promoters, provided such promoters are compatible with the host cell 
systems. 

Transcription of a DMA encoding the EG-VEGF by higher eukaryotes may be increased by inserting an 
5 enhancer sequence into the vector. Enhancers are cis-acting elements of DNA, usually about from 10 to 300 bp, that 

act on a promoter to increase its transcription. Many enhancer sequences are now known from mammalian genes 
(globin, elastase, albumin, -fetoprotein, and insulin). Typically, however, one will use an enhancer from a eukaryotic 
cell virus. Examples include the SV40 enhancer on the late side of the replication origin (bp 100-270), the 
cytomegalovirus early promoter enhancer, the polyoma enhancer on the late side of the replication origin, and 
10 adenovirus enhancers. The enhancer may be spliced into the vector at a position 5" or 3' to the EG-VEGF coding 

sequence, but is preferably located at a site 5' from the promoter. 

Expression vectors used in eukaryotic host cells (yeast, fungi, insect, plant, animal, human, or nucleated cells 
from other multicellular organisms) will also contain sequences necessary for the termination of transcription and for 
stabilizing the mRNA. Such sequences are commonly available from the 5' and, occasionally 3', untranslated regions 
15 of eukaryotic or viral DNAs or cDNAs. These regions contain nucleotide segments transcribed as polyadenylated 

fragments in the untranslated portion of the mRNA encoding EG-VEGF. 

Still other methods, vectors, and host cells suitable for adaptation to the synthesis of EG-VEGF in 
recombinant vertebrate cell culture are described in Gething et al., Nature , 293:620-625 (1981); Mantei et al., Nature , 
281:40-46 (1979); EP 1 17,060; and EP 1 17,058. 

20 

4. Detecting Gene Amplification/Expression 
Gene amplification and/or expression may be measured in a sample directly, for example, by conventional 
Southern blotting, Northern blotting to quantitate the transcription of mRNA [Thomas, Proc. Natl. Acad. Sci. USA , 
77:5201-5205 (1980)], dot blotting (DNA analysis), or in situ hybridization, using an appropriately labeled probe, 
25 based on the sequences provided herein. Alternatively, antibodies may be employed that can recognize specific 

duplexes, including DNA duplexes, RNA duplexes, and DNA-RNA hybrid duplexes or DNA-protein duplexes. The 
antibodies in turn may be labeled and the assay may be carried out where the duplex is bound to a surface, so that 
upon the formation of duplex on the surface, the presence of antibody bound to the duplex can be detected. 

Gene expression, alternatively, may be measured by immunological methods, such as immunohistochemical 
30 staining of cells or tissue sections and assay of cell culture or body fluids, to quantitate directly the expression of gene 

product. Antibodies useful for immunohistochemical staining and/or assay of sample fluids may be either monoclonal 
or polyclonal, and may be prepared in any mammal. Conveniently, the antibodies may be prepared against a native 
sequence EG-VEGF polypeptide or against a synthetic peptide based on the DNA sequences provided herein or against 
exogenous sequence fused to EG-VEGF DNA and encoding a specific antibody epitope. 

35 
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5. Purification of Polypeptide 
Forms of EG-VEGF may be recovered from culture medium or from host cell lysates. If membrane-bound, it 
can be released from the membrane using a suitable detergent solution (e.g. Triton-X 1 00) or by enzymatic cleavage. 
Cells employed in expression of EG-VEGF can be disrupted by various physical or chemical means, such as freeze-thaw 
5 cycling, sonication, mechanical disruption, or cell lysing agents. 

It may be desired to purify EG-VEGF from recombinant cell proteins or polypeptides. The following 
procedures are exemplary of suitable purification procedures: by fractionation on an ion-exchange column; ethanol 
precipitation; reverse phase HPLC; chromatography on silica or on a cation-exchange resin such as DEAE; 
chromatofocusing; SDS-PAGE; ammonium sulfate precipitation; gel filtration using, for example, Sephadex G-75; 
10 protein A Sepharose columns to remove contaminants such as IgG; and metal chelating columns to bind epitope-tagged 
forms of the EG-VEGF. Various methods of protein purification may be employed and such methods are known in the 
art and described for example in Deutscher, Methods in Enzymologv , 182 (1990); Scopes, Protein Purification: 
Principles and Practice , Springer-Verlag, Mew York (1982). The purification step(s) selected will depend, for example, 
on the nature of the production process used and the particular EG-VEGF produced. 

15 

E. Uses for EG-VEGF 

Nucleotide sequences (or their complement) encoding EG-VEGF have various applications in the art Df 
molecular biology, including uses as hybridization probes, in chromosome and gene mapping and in the generation of 
anti-sense RNA and DNA. EG-VEGF nucleic acid will also be useful for the preparation of EG-VEGF polypeptides by the 

20 recombinant techniques described herein. 

The full-length native sequence EG-VEGF gene fSEQ ID N0:1), or portions thereof, may be used as 
hybridization probes for a cDNA library to isolate the full-length EG-VEGF cDMA or to isolate still other cDNAs (for 
instance, those encoding naturally-occurring variants of EG-VEGF or EG-VEGF from other species) which have a desired 
sequence identity to the EG-VEGF sequence disclosed in Figure 1 (SEO ID I\I0:1). Optionally, the length of the probes 

25 will be about 20 to about 50 bases. The hybridization probes may be derived from at least partially novel regions of 

the nucleotide sequence of SEQ ID l\IO:1 wherein those regions may be determined without undue experimentation or 
from genomic sequences including promoters, enhancer elements and introns of native sequence EG-VEGF. By way of 
example, a screening method will comprise isolating the coding region of the EG-VEGF gene using the known DNA 
sequence to synthesize a selected probe of about 40 bases. Hybridization probes may be labeled by a variety of 

30 labels, including radionucleotides such as 32 P or 35 S, or enzymatic labels such as alkaline phosphatase coupled to the 

probe via avidin/biotin coupling systems. Labeled probes having a sequence complementary to that of the EG-VEGF 
gene of the present invention can be used to screen libraries of human cDNA, genomic DNA or mRNA to determine 
which members of such libraries the probe hybridizes to. Hybridization techniques are described in further detail in the 
Examples below. 
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Any EST sequences disclosed in the present application may similarly be employed as probes, using the 
methods disclosed herein. 

Other useful fragments of the EG-VEGF nucleic acids include antisense or sense oligonucleotides comprising 
a singe-stranded nucleic acid sequence (either RNA or DMA) capable of binding to target EG-VEGF mRNA (sense) or EG- 
5 VEGF DNA (antisense) sequences. Antisense or sense oligonucleotides, according to the present invention, comprise a 

fragment of the coding region of EG-VEGF DNA. Such a fragment generally comprises at least about 14 nucleotides, 
preferably from about 14 to 30 nucleotides. The ability to derive an antisense or a sense oligonucleotide, based upon 
a cDNA sequence encoding a given protein is described in, for example, Stein and Cohen ( Cancer Res. 48:2659, 1988) 
and van der Krol et al. (BioTechniques 6:958, 1988). 

10 Binding of antisense or sense oligonucleotides to target nucleic acid sequences results in the formation of 

duplexes that block transcription or translation of the target sequence by one of several means, including enhanced 
degradation of the duplexes, premature termination of transcription or translation, or by other means. The antisense 
oligonucleotides thus may be used to block expression of EG-VEGF proteins. Antisense or sense oligonucleotides 
further comprise oligonucleotides having modified sugar-phosphodiester backbones (or other sugar linkages, such as 

15 those described in WO 91/06629) and wherein such sugar linkages are resistant to endogenous nucleases. Such 

oligonucleotides with resistant sugar linkages are stable in vivo (i.e., capable of resisting enzymatic degradation) but 
retain sequence specificity to be able to bind to target nucleotide sequences. 

Other examples of sense or antisense oligonucleotides include those oligonucleotides which are covalently 
linked to organic moieties, such as those described in WO 90/10048, and other moieties that increases affinity of the 

20 oligonucleotide for a target nucleic acid sequence, such as poly-(L-lysine). Further still, intercalating agents, such as 

ellipticine, and alkylating agents or metal complexes may be attached to sense or antisense oligonucleotides to modify 
binding specificities of the antisense or sense oligonucleotide for the target nucleotide sequence. 

Antisense or sense oligonucleotides may be introduced into a cell containing the target nucleic acid sequence 
by any gene transfer method, including, for example, CaP0 4 -mediated DNA transfection, electroporation, or by using 

25 gene transfer vectors such as Epstein-Barr virus. In a preferred procedure, an antisense or sense oligonucleotide is 

inserted into a suitable retroviral vector. A cell containing the target nucleic acid sequence is contacted with the 
recombinant retroviral vector, either in vivo or ex vivo. Suitable retroviral vectors include, but are not limited to, those 
derived from the murine retrovirus M-MuLV, N2 (a retrovirus derived from M-MuLV), or the double copy vectors 
designated DCT5A, DCT5B and DCT5C (see WO 90/13641). 

30 Sense or antisense oligonucleotides also may be introduced into a cell containing the target nucleotide 

sequence by formation of a conjugate with a ligand binding molecule, as described in WO 91/04753. Suitable ligand 
binding molecules include, but are not limited to, cell surface receptors, growth factors, other cytokines, or other 
ligands that bind to cell surface receptors. Preferably, conjugation of the ligand binding molecule does not 
substantially interfere with the ability of the ligand binding molecule to bind to its corresponding molecule or receptor, 

35 or block entry of the sense or antisense oligonucleotide or its conjugated version into the cell. 
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Alternatively, a sense or an antisense oligonucleotide may he introduced into a cell containing the target 
nucleic acid sequence by formation of an oligonucleotide-lipid complex, as described in WO 90/10448. The sense or 
antisense oligonucleotide-lipid complex is preferably dissociated within the cell by an endogenous lipase. 

The probes may also be employed in PCR techniques to generate a pool of sequences for identification of 
5 closely related EG-VEGF coding sequences. 

Nucleotide sequences encoding a EG-VEGF can also be used to construct hybridization probes for mapping 
the gene which encodes that EG-VEGF and for the genetic analysis of individuals with genetic disorders. The 
nucleotide sequences provided herein may be mapped to a chromosome and specific regions of a chromosome using 
known techniques, such as in situ hybridization, linkage analysis against known chromosomal markers, and 

10 hybridization screening with libraries. 

This invention also provides for methods of using the EG-VEGF in assays to identify other proteins or 
molecules that can bind to the EG-VEGF protein. By such methods, inhibitors of the receptor/ligand binding interaction 
can be identified. Proteins involved in such binding interactions can also be used to screen for peptide or small 
molecule inhibitors or agonists of the binding interaction. Also, the receptor EG-VEGF can be used to isolate 

15 correlative ligand(s). Screening assays can be designed to find lead compounds that mimic the biological activity of a 

native EG-VEGF or a receptor for EG-VEGF. Such screening assays will include assays amenable to high-throughput 
screening of chemical libraries, making them particularly suitable for identifying small molecule drug candidates. Small 
molecules contemplated include synthetic organic or inorganic compounds. The assays can be performed in a variety 
of formats, including protein-protein binding assays, biochemical screening assays, immunoassays and cell based 

20 assays, which are well characterized in the art. 

Nucleic acids which encode EG-VEGF or its modified forms can also be used to generate either transgenic 
animals or "knock out" animals which, in turn, are useful in the development and screening of therapeutically useful 
reagents. A transgenic animal (e.g., a mouse or rat) is an animal having cells that contain a transgene, which 
transgene was introduced into the animal or an ancestor of the animal at a prenatal, e.g., an embryonic stage. A 

25 transgene is a DNA which is integrated into the genome of a cell from which a transgenic animal develops. In one 

embodiment, cDNA encoding EG-VEGF can be used to clone genomic DNA encoding EG-VEGF in accordance with 
established techniques and the genomic sequences used to generate transgenic animals that contain cells which 
express DNA encoding EG-VEGF. Methods for generating transgenic animals, particularly animals such as mice or 
rats, have become conventional in the art and are described, for example, in U.S. Patent Nos. 4,736,866 and 

30 4,870,009. Typically, particular cells would be targeted for EG-VEGF transgene incorporation with tissue-specific 

enhancers. Transgenic animals that include a copy of a transgene encoding EG-VEGF introduced into the germ line of 
the animal at an embryonic stage can be used to examine the effect of increased expression of DNA encoding EG- 
VEGF. Such animals can be used as. tester animals for reagents thought to confer protection from, for example, 
pathological conditions associated with its overexpression. In accordance with this facet of the invention, an animal 
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is treated with the reagent and a reduced incidence of the pathological condition, compared to untreated animals 
bearing the transgene, would indicate a potential therapeutic intervention for the pathological condition. 

Alternatively, non-human homologues of EG-VEGF can be used to construct a EG-VEGF "knock out" animal 
which has a defective or altered gene encoding EG-VEGF as a result of homologous recombination between the 
5 endogenous gene encoding EG-VEGF and altered genomic DNA encoding EG-VEGF introduced into an embryonic stem 

cell of the animal. For example, cDNA encoding EG-VEGF can be used to clone genomic DNA encoding EG-VEGF in 
accordance with established techniques. A portion of the genomic DMA encoding EG-VEGF can be deleted or replaced 
with another gene, such as a gene encoding a selectable marker which can be used to monitor integration. Typically, 
several kilobases of unaltered flanking DNA (both at the 5" and 3' ends) are included in the vector [see e.g., Thomas 

10 and Capecchi, Cej|, 51:503 (1987) for a description of homologous recombination vectors]. The vector is introduced 

into an embryonic stem cell line (e.g., by electroporation) and cells in which the introduced DNA has homologously 
recombined with the endogenous DNA are selected [see e.g., Li et al.. Cell, 69:915 (1992)]. The selected cells are 
then injected into a blastocyst of an animal (e.g., a mouse or rat) to form aggregation chimeras [see e.g., Bradley, in 
Teratocarcinomas and Embryonic Stem Cells: A Practical Approach, E. J. Robertson, ed. (IRL, Oxford, 1987), pp. 113- 

15 152]. A chimeric embryo can then be implanted into a suitable pseudopregnant female foster animal and the embryo 

brought to term to create a "knock out" animal. Progeny harboring the homologously recombined DNA in their germ 
cells can be identified by standard techniques and used to breed animals in which all cells of the animal contain the 
homologously recombined DNA. Knockout animals can be characterized for instance, for their ability to defend against 
certain pathological conditions and for their development of pathological conditions due to absence of the EG-VEGF 

20 polypeptide. 

Nucleic acid encoding the EG-VEGF polypeptides may also be used in gene therapy. In gene therapy 
applications, genes are introduced into cells in order to achieve in vivo synthesis of a therapeutically effective genetic 
product, for example for replacement of a defective gene. "Gene therapy" includes both conventional gene therapy 
where a lasting effect is achieved by a single treatment, and the administration of gene therapeutic agents, which 

25 involves the one time or repeated administration of a therapeutically effective DNA or mRNA. Antisense RNAs and 

DNAs can be used as therapeutic agents for blocking the expression of certain genes in vivo. It has already been 
shown that short antisense oligonucleotides can be imported into cells where they act as inhibitors, despite their low 
intracellular concentrations caused by their restricted uptake by the cell membrane. (Zamecnik etai, Proc. Natl. Acad. 
Sci. USA 83:4143-4146 [1986]). The oligonucleotides can be modified to enhance their uptake, e.g. by substituting 

3 0 their negatively charged phosphodiester groups by uncharged groups. 

There are a variety of techniques available for introducing nucleic acids into viable cells. The techniques 
vary depending upon whether the nucleic acid is transferred into cultured cells in vitro, or in vivo in the cells of the 
intended host. Techniques suitable for the transfer of nucleic acid into mammalian cells in vitro include the use of 
liposomes, electroporation, microinjection, cell fusion, DEAE-dextran, the calcium phosphate precipitation method, etc. 

3 5 The currently preferred in vivo gene transfer techniques include transfection with viral (typically retroviral) vectors and 
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viral coat protein-liposome mediated transfection (Dzau et al., Trends in Biotechnology 11, 205-210 [1993]). In some 
situations it is desirable to provide the nucleic acid source with an agent that targets the target cells, such as an 
antibody specific for a cell surface membrane protein or the target cell, a ligand for a receptor on the target cell, etc. 
Where liposomes are employed, proteins which hind to a cell surface membrane protein associated with endocytosis 
5 may be used for targeting and/or to facilitate uptake, e.g. capsid proteins or fragments thereof tropic for a particular 

cell type, antibodies for proteins which undergo internalization in cycling, proteins that target intracellular localization 
and enhance intracellular half-life. The technique of receptor-mediated endocytosis is described, for example, by Wu 
et al., J. Biol. Chem. 262, 4429-4432 (1987); and Wagner et al., Proc. Natl. Acad. Sci. USA 87, 3410-3414 (1990). 
For review of gene marking and gene therapy protocols see Anderson et al., Science 256, 808-813 (1992). 
10 The EG-VEGF polypeptides described herein may also be employed as molecular weight markers for protein 

electrophoresis purposes. 

The nucleic acid molecules encoding the EG-VEGF polypeptides or fragments thereof described herein are 
useful far chromosome identification. In this regard, there exists an ongoing need to identify new chromosome 
markers, since relatively few chromosome marking reagents, based upon actual sequence data are presently available. 

1 5 Each EG-VEGF nucleic acid molecule of the present invention can be used as a chromosome marker. 

The EG-VEGF polypeptides and nucleic acid molecules of the present invention may also be used for tissue 
typing, wherein the EG-VEGF polypeptides of the present invention may be differentially expressed in one tissue as 
compared to another. EG-VEGF nucleic acid molecules will find use for generating probes for PCR, Northern analysis, 
Southern analysis and Western analysis. 

20 The EG-VEGF polypeptides and modulators thereof described herein may also be employed as therapeutic 

agents. The EG-VEGF polypeptides and EG-VEGF modulators of the present invention can be formulated according to 
known methods to prepare pharmaceutically useful compositions, whereby the EG-VEGF product hereof is combined in 
admixture with a pharmaceutically acceptable carrier vehicle. Therapeutic formulations are prepared for storage by 
mixing the active ingredient having the desired degree of purity with optional physiologically acceptable carriers, 

25 excipients or stabilizers ( Remington's Pharmaceutical Sciences 16th edition, Osol, A. Ed. (1980)), in the form of 

lyophilized formulations or aqueous solutions. Acceptable carriers, excipients or stabilizers are nontoxic to recipients 
at the dosages and concentrations employed, and include buffers such as phosphate, citrate and other organic acids; 
antioxidants including ascorbic acid; low molecular weight (less than about 10 residues) polypeptides; proteins, such 
as serum albumin, gelatin or immunoglobulins; hydrophilic polymers such as polyvinylpyrrolidone, amino acids such as 

30 glycine, glutamine, asparagine, arginine or lysine; monosaccharides, disaccharides and other carbohydrates including 
glucose, mannose, or dextrins; chelating agents such as EDTA; sugar alcohols such as mannitol or sorbitol; salt- 
forming counterions such as sodium; and/or nonionic surfactants such as TWEEN™, PLURONICS™ or PEG. 

The formulations to be used for in vivo administration must be sterile. This is readily accomplished by 
filtration through sterile filtration membranes, prior to or following lyophilization and reconstitution. 
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Therapeutic compositions herein generally are placed into a container having a sterile access port, for 
example, an intravenous solution bag or vial having a stopper pierceable by a hypodermic injection needle. 

The route of administration is in accord with known methods, e.g. injection or infusion by intravenous, 
intraperitoneal, intracerebral, intramuscular, intraocular, intraarterial or intralesional routes, topical administration, or 
5 by sustained release systems. 

Dosages and desired drug concentrations of pharmaceutical compositions of the present invention may vary 
depending on the particular use envisioned. The determination of the appropriate dosage or route of administration is 
well within the skill of an ordinary physician. Animal experiments provide reliable guidance for the determination of 
effective doses for human therapy. Interspecies scaling of effective doses can be performed following the principles 
10 laid down by Mordenti, J. and Chappell, W. "The use of interspecies scaling in toxicokinetics" In Toxicokinetics and 

New Drug Development, Yacobi et a!., Eds., Pergamon Press, New York 1989, pp. 42-96. 

When in vivo administration of a EG-VEGF polypeptide or agonist or antagonist thereof is employed, normal 
dosage amounts may vary from about 10 ng/kg to up to 100 mg/kg of mammal body weight or more per day, 
preferably about 1 g/kg/day to 10 mg/kgfday, depending upon the route of administration. Guidance as to particular 
15 dosages and methods of delivery is provided in the literature; see, for example, U.S. Pat. Nos. 4,657,760; 5,206,344; 

or 5,225,212. It is anticipated that different formulations will be effective for different treatment compounds and 
different disorders, that administration targeting one organ or tissue, for example, may necessitate delivery in a 
manner different from that to another organ or tissue. 

Where sustained-release administration of a EG-VEGF polypeptide or modulator is desired in a formulation 
20 with release characteristics suitable for the treatment of any disease or disorder requiring administration of the EG- 
VEGF polypeptide, microencapsulation of the EG-VEGF polypeptide is contemplated. Microencapsulation of 
recombinant proteins for sustained release has been successfully performed with human growth hormone (rhGH), 
interferon- (rhIFN- ), interleukin-2, and MN rgp120. Johnson et al., Mat. Med. , 2:795-799 (1996); Yasuda, Biomed. 
Then, 27:1221-1223 (1993); Hora et al., Bio/Technology, 8:755-758 (1990); Cleland, "Design and Production of 
25 Single Immunization Vaccines Using Polylactide Polyglycolide Microsphere Systems," in Vaccine Design: The Subunit 

and Adjuvant Approach, Powell and Newman, eds, (Plenum Press: New York, 1995), pp. 439-462; WO 97/03692, WO 
96/40072, WO 96/07399; and U.S. Pat. No. 5,654,010. 

The sustained-release formulations of these proteins were developed using poly-lactic-coglycolic acid (PLGA) 
polymer due to its biocompatibility and wide range of biodegradable properties. The degradation products of PLGA, 
30 lactic and glycolic acids, can be cleared quickly within the human body. Moreover, the degradability of this polymer 

can be adjusted from months to years depending on its molecular weight and composition. Lewis, "Controlled release 
of bioactive agents from lactide/glycolide polymer," in: M. Chasin and R. Langer (Eds.), Biodegradable Polymers as 
Drug Delivery Systems (Marcel Dekker: New York, 1990), pp. 1-41. 

The therapeutic agents provided herein can be used in a number of treatments. The treatments include 
35 treating an individual with a condition associated with hormone producing tissue or endocrine glands. In one aspect, 
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EG-VEGF or a EG-VEGF agonist is administered to an individual in need thereof in an amount effective to treat the 
condition. EG-VEGF can be administered in a polypeptide or nucleic acid form. Preferably, the condition is one which 
requires an increase in the number of cells producing a particular hormone. Examples of such conditions include 
diabetes. Other conditions include those wherein it is desired to increase the number of cells in the reproductive 
5 organs such as those of the ovary, testis, uterus, prostate and placenta. 

Preferably, a EG-VEGF antagonist is administered to an individual with a condition associated with hormone 
producing tissue or endocrine glands. Wherein an EG-VEGF antagonist is administered, preferably the condition is one 
which requires a decrease in the numher of cells producing a particular hormone or a decrease in cell proliferation. For 
example, a method of regulating fertility in an individual is provided herein which comprises administering a EG-VEGF 

10 antagonist to the individual in an amount effective to regulate fertility. In one embodiment, fertility is regulated by 
inhibiting follicle maturation andjor ovulation. Alternatively, wherein the individual has or is at risk of having 
polycystic ovary syndrome a EG-VEGF antagonist is administered to the individual to maintain fertility by preventing 
the infertility which generally results from not treating the syndrome. An individual is at risk of a condition if there if 
the condition is hereditary and frequent in the family or has early symptoms of the condition. EG-VEGF antagonists 

15 can also be administered to treat cysts and other conditions associated with overproliferation, inflammation and 

excessive angiogenesis in hormone producing tissues. 

Steroid hormone-dependent disorders that may be addressed using compositions and methods of the present 
invention further include lipoid congenital adrenal hyperplasia, infertility, sexual maturation, androgen-dependent 
tumors, precocious puberty, McCune-Albright syndrome, adrenal-hypoplasia congenita, or hypogonadotropic 

20 hypogonadism. 

A specific condition which can be treated by the agents and compositions provided herein is cancer, in 
particular steroid-, e.g. androgen-dependent cancer. A preferred method of treating cancer as provided herein 
comprises administering a EG-VEGF antagonist to an individual with or at risk of having cancer in an amount effective 
to treat cancer. In one embodiment, the cancer is of a tissue selected from the group consisting of ovary, testis, 

25 prostate and uterus. 

It is understood that the methods of cell proliferation, inhibition of cell proliferation, chemotaxis, and 
methods of inhibiting chemotaxis can be performed in vivo or in vitro. In some cases, it may be desirable to add EG- 
VEGF to a cell sample in vitro so as to stimulate proliferation of a specific cell type. The EG-VEGF treated sample can 
then be used in screening assays or be transplanted into an individual in need of treatment or into an animal model. 

30 This invention also encompasses methods of screening compounds to identify those that mimic or enhance 

the EG-VEGF polypeptide (agonists) or prevent the effect of the EG-VEGF polypeptide (antagonists). EG-VEGF agonists 
and antagonists are also referred to as EG-VEGF modulators herein. Screening assays for antagonist drug candidates 
are designed to identify compounds that bind or complex with the EG-VEGF polypeptides encoded by the genes 
identified herein, or otherwise interfere with the interaction of the encoded polypeptides with other cellular proteins. 

35 The screening assays provided herein include assays amenable to high-throughput screening of chemical libraries, 
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making them particularly suitable for identifying small molecule drug candidates. Generally, binding assays and 
activity assays are provided. 

The assays can be performed in a variety of formats, including protein-protein binding assays, biochemical 
screening assays, immunoassays, and cell-based assays, which are well characterized in the art. 
5 All assays for antagonists are common in that they call for contacting the drug candidate with an EG-VEGF 

polypeptide encoded by a nucleic acid identified herein under conditions and for a time sufficient to allow these two 
components to interact. 

In binding assays, the interaction is binding and the complex formed can be isolated or detected in the 
reaction mixture. In a particular embodiment, either the EG-VEGF polypeptide encoded by the gene identified herein or 

10 the drug candidate is immobilized on a solid phase, e.g., on a microtiter plate, by covalent or non-covalent 
attachments. Non-covalent attachment generally is accomplished by coating the solid surface with a solution of the 
EG-VEGF polypeptide and drying. Alternatively, an immobilized antibody, e.g., a monoclonal antibody, specific for the 
EG-VEGF polypeptide to be immobilized can be used to anchor it to a solid surface. The assay is performed by adding 
the non-immobilized component, which may be labeled by a detectable label, to the immobilized component, e.g., the 

15 coated surface containing the anchored component. When the reaction is complete, the non-reacted components are 
removed, e.g., by washing, and complexes anchored on the solid surface are detected. When the originally non- 
immobilized component carries a detectable label, the detection of label immobilized on the surface indicates that 
complexing occurred. Where the originally non-immobilized component does not carry a label, complexing can he 
detected, for example, by using a labeled antibody specifically binding the immobilized complex. 

20 If the candidate compound interacts with but does not bind to a particular EG-VEGF polypeptide encoded by 

a gene identified herein, its interaction with that polypeptide can be assayed by methods well known for detecting 
protein-protein interactions. Such assays include traditional approaches, such as, e.g., cross-linking, co- 
immunoprecipitation, and co-purification through gradients or chromatographic columns. In addition, protein-protein 
interactions can be monitored by using a yeast-based genetic system described by Fields and co-workers (Fields and 

25 Song, Nature (London) . 340:245-246 (1989); Chien et al., Proc. Natl. Acad. Sci. USA , 88:9578-9582 (1991)) as 

disclosed by Chevray and Nathans, Proc. Natl. Acad. Sci. USA , 89: 5789-5793 (1991). Many transcriptional 
activators, such as yeast GAL4, consist of two physically discrete modular domains, one acting as the DNA-binding 
domain, the other one functioning as the transcription-activation domain. The yeast expression system described in 
the foregoing publications (generally referred to as the "two-hybrid system") takes advantage of this property, and 

30 employs two hybrid proteins, one in which the target protein is fused to the DNA-binding domain of GAL4, and 

another, in which candidate activating proteins are fused to the activation domain. The expression of a GAL1-/acZ 
reporter gene under control of a GAL4-activated promoter depends on reconstitution of GAL4 activity via protein- 
protein interaction. Colonies containing interacting polypeptides are detected with a chromogenic substrate for - 
galactosidase. A complete kit (MATCHMAKER™) for identifying protein-protein interactions between two specific 

35 proteins using the two-hybrid technique is commercially available from Clontech. This system can also be extended to 
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map protein domains involved in specific protein interactions as well as to pinpoint amino acid residues that are crucial 
for these interactions. 

Compounds that interfere with the interaction of a gene encoding an EG-VEGF polypeptide identified herein 
and other intra- or extracellular components can be tested as follows: usually a reaction mixture is prepared 
5 containing the product of the gene and the intra- or extracellular component under conditions and for a time allowing 

for the interaction and binding of the two products. To test the ability of a candidate compound to inhibit binding, the 
reaction is run in the absence and in the presence of the test compound. In addition, a placebo may be added to a 
third reaction mixture, to serve as positive control. The binding (complex formation) between the test compound and 
the intra- or extracellular component present in the mixture is monitored as described hereinabove. The formation of a 

10 complex in the control reaction(s) but not in the reaction mixture containing the test compound indicates that the test 

compound interferes with the interaction of the test compound and its reaction partner. 

To assay for agonists and antagonists, the EG-VEGF polypeptide may be added to a cell along with the 
compound to be screened. A particular actiivty known to be modulated by EG-VEGF is observed and the ability of the 
compound to enhance or inhibit this activity, in the presence of the EG-VEGF polypeptide, indicates that the compound 

15 is an agonist or antagonist to the EG-VEGF polypeptide, respectively. Alternatively, agonists and antagonists may be 

detected by combining the EG-VEGF polypeptide and candidate compounds with membrane-bound EG-VEGF polypeptide 
receptors or recombinant receptors under appropriate conditions for a competitive inhibition assay. The EG-VEGF 
polypeptide can be labeled, such as by radioactivity, such that the number of EG-VEGF polypeptide molecules bound to 
the receptor can be used to determine the effectiveness of the potential agonist or antagonist. 

20 The gene encoding the receptor for EG-VEGF can be identified by numerous methods known to those of skill 

in the art, for example, ligand panning and FACS sorting. Coligan et al., Current Protocols in Immun. , 1(2): Chapter 5 
(1991). Preferably, expression cloning is employed wherein polyadenylated RI\IA is prepared from a cell responsive to 
the EG-VEGF polypeptide and a cDNA library created from this RNA is divided into pools and used to transfect COS 
cells or other cells that are not responsive to the EG-VEGF polypeptide. Transfected cells that are grown on glass 

25 slides are exposed to labeled EG-VEGF polypeptide. The EG-VEGF polypeptide can be labeled by a variety of means 

including iodination or inclusion of a recognition site for a site-specific protein kinase. Following fixation and 
incubation, the slides are subjected to autoradiographic analysis. Positive pools are identified and sub-pools are 
prepared and re-transfected using an interactive sub-pooling and re-screening process, eventually yielding a single 
clone that encodes the putative receptor. 

30 As an alternative approach for receptor identification, labeled EG-VEGF polypeptide can be photoaffinity- 

linked with cell membrane or extract preparations that express the receptor molecule. Cross-linked material is 
resolved by PAGE and exposed to X-ray film. The labeled complex containing the receptor can be excised, resolved 
into peptide fragments, and subjected to protein micro-sequencing. The amino acid sequence obtained from micro- 
sequencing would be used to design a set of degenerate oligonucleotide probes to screen a cDNA library to identify the 

35 gene encoding the putative receptor. 
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In another assay for antagonists, mammalian cells or a membrane preparation expressing the receptor would 
be incubated with labeled EG-VEGF polypeptide in the presence of the candidate compound. The ability of the 
compound to enhance or block this interaction could then be measured. 

More specific examples of potential antagonists include an oligonucleotide that binds to the fusions of 
5 immunoglobulin with EG-VEGF polypeptide, and, in particular, antibodies including, without limitation, poly- and 

monoclonal antibodies and antibody fragments, single-chain antibodies, anti-idiotypic antibodies, and chimeric or 
humanized versions of such antibodies or fragments, as well as human antibodies and antibody fragments. 
Alternatively, a potential antagonist may be a closely related protein, for example, a mutated form of the EG-VEGF 
polypeptide that recognizes the receptor but imparts no effect, thereby competitively inhibiting the action of the EG- 

10 VEGF polypeptide. 

Another potential EG-VEGF polypeptide antagonist is an antisense RNA or DNA construct prepared using 
antisense technology, where, e.g., an antisense RNA or DNA molecule acts to block directly the translation of mRNA 
by hybridizing to targeted mRNA and preventing protein translation. Antisense technology can be used to control gene 
expression through triple-helix formation or antisense DNA or RNA, both of which methods are based on binding of a 

15 polynucleotide to DNA or RNA. For example, the 5' coding portion of the polynucleotide sequence, which encodes the 

mature EG-VEGF polypeptides herein, is used to design an antisense RNA oligonucleotide of from about 10 to 40 base 
pairs in length. A DNA oligonucleotide is designed to be complementary to a region of the gene involved in 
transcription (triple helix • see Lee et al., Nucl. Acids Res., 6:3073 (1979); Cooney et al., Science , 241: 456 (1988); 
Dervan et al.. Science , 251:1360 (1991)), thereby preventing transcription and the production of the EG-VEGF 

20 polypeptide. The antisense RNA oligonucleotide hybridizes to the mRNA in vivo and blocks translation of the mRNA 

molecule into the EG-VEGF polypeptide (antisense • Okano, Neurochem. , 56:560 (1991); Oligodeoxynucleotides as 
Antisense Inhibitors of Gene Expression (CRC Press: Boca Raton, FL, 1988). The oligonucleotides described above can 
also be delivered to cells such that the antisense RNA or DNA may be expressed in vivo to inhibit production of the EG- 
VEGF polypeptide. When antisense DNA is used, oligodeoxyribonucleotides derived from the translation-initiation site, 

25 e.g., between about -10 and +10 positions of the target gene nucleotide sequence, are preferred. 

Potential antagonists include small molecules that bind to the active site, the receptor binding site, or 
growth factor or other relevant binding site of the EG-VEGF polypeptide, thereby blocking the normal biological activity 
of the EG-VEGF polypeptide. Examples of small molecules include, but are not limited to, small peptides or peptide-like 
molecules, preferably soluble peptides, and synthetic non-peptidyl organic or inorganic compounds. 

30 In an alternative embodiment, overexpression of an agonist can serve as an antagonist wherein activity is 

regulated by positive feedback. 

Ribozymes are enzymatic RNA molecules capable of catalyzing the specific cleavage of RNA. Ribozymes act 
by sequence-specific hybridization to the complementary target RNA, followed by endonucleolytic cleavage. Specific 
ribozyme cleavage sites within a potential RNA target can be identified by known techniques. For further details see, 
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e.g., Rossi, Current Biology , 4:469-471 (1994), and PCT publication No. WQ 97133551 (published September 18, 
1997). 

Nucleic acid molecules in triple-helix formation used to inhibit transcription should be single-stranded and 
composed of deoxynucleotides. The base composition of these oligonucleotides is designed such that it promotes 
5 triple-helix formation via Hoogsteen base-pairing rules, which generally require sizeable stretches of purines or 

pyrimidines on one strand of a duplex. For further details see, e.g., PCT publication No. WO 97/33551, supra. 

These small molecules can be identified by any one or more of the screening assays discussed hereinabove 
and/or by any other screening techniques well known to those skilled in the art. 

It is appreciated that all the assays provided herein can be used to screen a wide variety of candidate 
10 bioactive agents. The term "candidate bioactive agent", "candidate agent" or "drug candidate" or grammatical 
equivalents as used herein describes any molecule, e.g., protein, oligopeptide, small organic molecule, polysaccharide, 
polynucleotide, purine analog, etc., to be tested for bioactive agents that are capable of directly or indirectly altering 
either the cellular activity phenotype or the expression of a EG-VEGF sequence, including both nucleic acid sequences 
and protein sequences. 

15 Candidate agents can encompass numerous chemical classes, though typically they are organic molecules, 

preferably small organic compounds having a molecular weight of more than 100 and less than about 2,500 daltons. 
Small molecules are further defined herein as having a molecular weight of between 50 kD and 2000 kD. In another 
embodiment, small molecules have a molecular weight of less than 1500, or less than 1200, or less than 1000, or less 
than 750, or less than 500 kD. In one embodiment, a small molecule as used herein has a molecular weight of about 

20 100 to 200 kD. Candidate agents comprise functional groups necessary for structural interaction with proteins, 

particularly hydrogen bonding, and typically include at least an amine, carbonyl, hydroxyl or carboxyl group, preferably 
at least two of the functional chemical groups. The candidate agents often comprise cyclical carbon or heterocyclic 
structures and/or aromatic or polyaromatic structures substituted with one or more of the above functional groups. 
Candidate agents are also found among biomolecules including peptides, saccharides, fatty acids, steroids, purines, 

25 pyrimidines, derivatives, structural analogs or combinations thereof. Particularly preferred are peptides. 

Candidate agents are obtained from a wide variety of sources including libraries of synthetic or natural 
compounds. For example, numerous means are available for random and directed synthesis of a wide variety of 
organic compounds and biomolecules, including expression of randomized oligonucleotides. Alternatively, libraries of 
natural compounds in the form of bacterial, fungal, plant and animal extracts are available or readily produced. 

30 Additionally, natural or synthetically produced libraries and compounds are readily modified through conventional 

chemical, physical and biochemical means. Known pharmacological agents may be subjected to directed or random 
chemical modifications, such as acylation, alkylation, esterification, amidification to produce structural analogs. 

In a preferred embodiment, the candidate bioactive agents are proteins. By "protein" herein is meant at 
least two covalently attached amino acids, which includes proteins, polypeptides, oligopeptides and peptides. The 

35 protein may be made up of naturally occurring amino acids and peptide bonds, or synthetic peptidomimetic structures. 
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Thus "amino acid", or "peptide residue", as used herein means both naturally occurring and synthetic amino acids. For 
example, homo-phenylalanine, citrulline and noreleucine are considered amino acids for the purposes of the invention. 
"Amino acid" also includes imino acid residues such as proline and hydroxyproline. The side chains may be in either 
the (R) or the (S) configuration. In the preferred embodiment, the amino acids are in the (S) or L-configuration. If non- 
5 naturally occurring side chains are used, non-amino acid substituents may be used, for example to prevent or retard in 
vivo degradations. 

In a preferred embodiment, the candidate bioactive agents are naturally occurring proteins or fragments of 
naturally occurring proteins. Thus, for example, cellular extracts containing proteins, or random or directed digests of 
proteinaceous cellular extracts, may be used. In this way libraries of procaryotic and eucaryotic proteins may be 

10 made for screening in the methods of the invention. Particularly preferred in this embodiment are libraries of bacterial, 

fungal, viral, and mammalian proteins, with the latter being preferred, and human proteins being especially preferred. 

In a preferred embodiment, the candidate bioactive agents are peptides of from about 5 to about 30 amino 
acids, with from about 5 to about 20 amino acids being preferred, and from about 7 to about 1 5 being particularly 
preferred. The peptides may be digests of naturally occurring proteins as is outlined above, random peptides, or 

15 "biased" random peptides. By "randomized" or grammatical equivalents herein is meant that each nucleic acid and 

peptide consists of essentially random nucleotides and amino acids, respectively. Since generally these random 
peptides (or nucleic acids, discussed below) are chemically synthesized, they may incorporate any nucleotide or amino 
acid at any position. The synthetic process can be designed to generate randomized proteins or nucleic acids, to allow 
the formation of all or most of the possible combinations over the length of the sequence, thus forming a library of 

20 randomized candidate bioactive proteinaceous agents. 

In one embodiment, the library is fully randomized, with no sequence preferences or constants at any 
position. In a preferred embodiment, the library is biased. That is, some positions within the sequence are either held 
constant, or are selected from a limited number of possibilities. For example, in a preferred embodiment, the 
nucleotides or amino acid residues are randomized within a defined class, for example, of hydrophobic amino acids, 

25 hydrophilic residues, sterically biased (either small or large) residues, towards the creation of nucleic acid binding 

domains, the creation of cysteines, for cross-linking, prolines for SH-3 domains, serines, threonines, tyrosines or 
histidines for phosphorylation sites, etc., or to purines, etc. 

In a preferred embodiment, the candidate bioactive agents are nucleic acids. By "nucleic acid" or 
"oligonucleotide" or grammatical equivalents herein means at least two nucleotides covalently linked together. A 

30 nucleic acid of the present invention will generally contain phosphodiester bonds, although in some cases, as outlined 

below, nucleic acid analogs are included that may have alternate backbones, comprising, for example, phosphoramide 
(Beaucage et al., Tetrahedron 49(10): 1925 (1993) and references therein; Letsinger, J. Org. Chem. 35:3800 (1970); 
Sprinzl et al., Eur. J. Biochem. 81:579 (1977); Letsinger et al., Nucl. Acids Res. 14:3487 (1986); Sawai et al, Chem. 
Lett. 805 (1984), Letsinger et al., J. Am. Chem. Soc. 110:4470 (1988); and Pauwels et al., Chemica Scripta 26:141 

35 91986)), phosphorothioate (Mag et al., Nucleic Acids Res. 19:1437 (1991); and U.S. Patent No. 5,644,048), 
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phosphorodithioate (Briu et al., J. Am. Chem. Soc. 111:2321 (1989), O-methylphophoroamidite linkages (see Eckstein, 
Oligonucleotides and Analogues: A Practical Approach, Oxford University Press), and peptide nucleic acid backbones 
and linkages (see Egholm, J. Am. Chem. Soc. 114:1895 (1992); Meier et al., Chem. Int. Ed. Engl. 31:1008 (1992); 
Nielsen, Nature, 365:566 (1993); Carlsson et al., Nature 380:207 (1996), all of which are incorporated by reference). 
5 Other analog nucleic acids include those with positive backbones (Denpcy et al., Proc. Natl. Acad. Sci. USA 92:6097 

(1995) ; non-ionic backbones (U.S. Patent Nos. 5,386,023, 5,637,684, 5,602,240, 5,216,141 and 4,469,863; 
Kiedrowshi et al., Angew. Chem. Intl. Ed. English 30:423 (1991); Letsinger et al., J. Am. Chem. Soc. 110:4470 
(1988); Letsinger et al.. Nucleoside & Nucleotide 13:1597 (1994); Chapters 2 and 3, ASC Symposium Series 580, 
"Carbohydrate Modifications in Antisense Research", Ed. Y.S. Sanghui and P. Dan Cook; Mesmaeker et al., Bioorganic 

10 & Medicinal Chem. Lett. 4:395 (1994); Jeffs et al., J. Biomolecular NMR 34:17 (1994); Tetrahedron Lett. 37:743 

(1996) ) and non-ribose backbones, including those described in U.S. Patent Nos. 5,235,033 and 5,034,506, and 
Chapters 6 and 7, ASC Symposium Series 580, "Carbohydrate Modifications in Antisense Research", Ed. Y.S. Sanghui 
and P. Dan Cook. Nucleic acids containing one or more carbocyclic sugars are also included within the definition of 
nucleic acids (see Jenkins et al., Chem. Soc. Rev. (1995) pp169-176). Several nucleic acid analogs are described in 

15 Rawls, C & E News June 2, 1997 page 35. All of these references are hereby expressly incorporated by reference. 

These modifications of the ribose-phosphate backbone may be done to facilitate the addition of additional moieties 
such as labels, or to increase the stability and half-life of such molecules in physiological environments. In addition, 
mixtures of naturally occurring nucleic acids and analogs can be made. Alternatively, mixtures of different nucleic acid 
analogs, and mixtures of naturally occurring nucleic acids and analogs may be made. The nucleic acids may be single 

20 stranded or double stranded, as specified, or contain portions of both double stranded or single stranded sequence. 

The nucleic acid may be DNA, both genomic and cDNA, RNA or a hybrid, where the nucleic acid contains any 
combination of deoxyribo- and ribo-nucleotides, and any combination of bases, including uracil, adenine, thymine, 
cytosine, guanine, inosine, xathanine hypoxathanine, isocytosine, isoguanine, etc. 

As described above generally for proteins, nucleic acid candidate bioactive agents may be naturally occurring 

25 nucleic acids, random nucleic acids, or "biased" random nucleic acids. For example, digests of procaryotic or 

eucaryotic genomes may be used as is outlined above for proteins. 

In a preferred embodiment, the candidate bioactive agents are organic chemical moieties, a wide variety of 
which are available in the literature. 

In a preferred embodiment, as outlined above, screens may be done on individual genes and gene products 

30 (proteins). In a preferred embodiment, the gene or protein has been identified as described below in the Examples as a 

differentially expressed gene associated with particular tissues and thus conditions related to those tissues. Thus, in 
one embodiment, screens are designed to first find candidate agents that can bind to EG-VEGF, and then these agents 
may be used in assays that evaluate the ability of the candidate agent to modulate EG-VEGF activity. Thus, as will be 
appreciated by those in the art, there are a number of different assays which may be run. 
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In one embodiment, EG-VEGF, preferably immobilized on a solid support, is contacted with a plurality of 
candidate bioactive agents and candidates that bind to EG-VEGF are selected for further study. Binding of candidate 
bioactive agents may be measured directly if the candidate bioactive agent is labelled. A candidate bioactive agent 
may preferably be labelled radioactively. Alternatively it may be labelled fluorescently. If the candidate bioactive 
5 agent is not labelled, binding may be determined indirectly based on a measured response to the binding. Alternatively, 

interaction with a candidate bioactive compound can be assessed based on the ability of the candidate bioactive 
compound to inhibit the binding of a known, labelled ligand. 

Screening for agents that modulate the activity of EG-VEGF may also be done. In a preferred embodiment, 
methods for screening for a bioactive agent capable of modulating the activity of EG-VEGF comprise the steps of 

10 adding a candidate bioactive agent to a sample of EG-VEGF and determining an alteration in the biological activity of 

EG-VEGF. "Modulating the activity of EG-VEGF" includes an increase in activity, a decrease in activity, or a change in 
the type or kind of activity present. Thus, in this embodiment, the candidate agent should both bind to EG-VEGF 
(although this may not be necessary), and alter its biological or biochemical activity as defined herein. The methods 
include both in vitro screening methods, as are generally outlined above, and in vivo screening of cells for alterations in 

1 5 the presence, distribution, activity or amount of EG-VEGF. 

Thus, in this embodiment, the methods comprise combining a sample and a candidate bioactive agent, and 
evaluating the effect on EG-VEGF activity. By "EG-VEGF protein activity" or grammatical equivalents herein is meant 
at least one of the EG-VEGF protein's biological activities, including, but not limited to cellular proliferation, 
chemotaxis/migration activity, angiogenesis, cell differentiation and cell fenestration. These activities are preferably 

20 specific in hormone producing tissues and cells, and more preferably, steroidogenic cells. Preferred cell types for 
specific activity include those of the reproductive system including cells of the ovaries, testis, prostate, uterus and 
placenta. Particular preferred cell types include the stroma and theca interna of the ovary and Leydig cells of the 
testis. Cells of the pancreas and the adrenal cortex are also preferred. An inhibitor of EG-VEGF activity is the 
inhibition of any one or more EG-VEGF protein activities. 

25 In a preferred embodiment, the activity of the EG-VEGF protein is increased; in another preferred 

embodiment, the activity of the EG-VEGF protein is decreased. Thus, bioactive agents that are antagonists are 
preferred in some embodiments, and bioactive agents that are agonists may be preferred in other embodiments. 

In one aspect of the invention, cells containing EG-VEGF sequences are used in drug screening assays by 
evaluating the effect of drug candidates on EG-VEGF. Cell type include normal cells, and more preferably cells with 

30 abnormal proliferative rates including tumor cells, most preferably human tumor cells. 

Methods of assessing EG-VEGF activity are known in the art and include growth and viability assays using 
cultured or primary cells. In such assays, cell populations are monitored for growth and or viability, often over time 
and comparing samples incubated with various concentrations of the bioactive agent or without the bioactive agent. 
Cell number can be quantified using agents that such as 3-(4,5-dimethylthiazol-2-yl)-2,5-diphenyltetrazolim bromide 

35 (MTT), 3-(4,5-dimethylthiazol-2-yl)-5-(3-carboxymethoxyphenyl)-2-(4-sulfophenyl)-2H-tetrazolium (MTS) [U.S. Pat. No. 
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5,185,450] and Alamar Blue which are converted to colored or fluorescent compounds in the presence of 

metabolically active cells. Alternatively, dyes that bind to cellular protein such as sulforhodamine B (SRB) or crystal 

violet can be used to quantify cell number. Alternatively, cells can be directly counted using a particle counter, such 

as a Coulter Counter™ manufactured by Beckman Coulter, or counted using a microscope to observe cells on a 
5 hemocytometer. Preferably, cells counted using the hemocytometer are observed in a solution of trypan blue to 

distinguish viable from dead cells. Other methods of quantifying cell number are known to those skilled in the art. 

These assays can be performed on any of the cells, including those in a state of necrosis. 

A protein has chemotactic activity for a particular cell population if it can stimulate, directly or indirectly, 

the directed orientation or movement of the cell population. Preferably, the protein has the ability to directly stimulate 
10 directed movement of the cells. As described below, EG-VEGF has chemotactic activity. Changes in the chemotactic 

activity of EG-VEGF can readily be determined by employing known assays for cell chemotaxis (e.g., migration assays 

as described below in the examples). 

In a preferred embodiment, the methods comprise adding a candidate bioactive agent, as defined above, to a 

cell comprising EG-VEGF. Preferred cell types include almost any cell. The cells contain a nucleic acid, preferably 
1 5 recombinant, that encodes a EG-VEGF protein. In a preferred embodiment, a library of candidate agents are tested on a 

plurality of cells. 

In one aspect, the assays are evaluated in the presence or absence or previous or subsequent exposure to 
physiological signals, for example hormones, antibodies, peptides, antigens, cytokines, growth factors, action 
potentials, pharmacological agents including chemotherapeutics, radiation, carcinogenics, or other cells (i.e. cell-cell 
20 contacts). In another example, the determinations are determined at different stages of the cell cycle process. 

The EG-VEGF sequences provided herein can also be used in methods of diagnosis. Dverexpression of EG- 
VEGF may indicate a cyst or cancer in a reproductive organ. Moreover, a sample from a patient may be analyzed for 
mutated or disfunctional EG-VEGF. Generally, such methods include comparing a sample from a patient and 
comparing EG-VEGF expression to that of a control. 

25 

F. Anti E6-VEGF Antibodies 
The present invention further provides anti-EG-VEGF antibodies. Exemplary antibodies include polyclonal, 
monoclonal, humanized, bispecific, and heteroconjugate antibodies. 

30 1. Polyclonal Antibodies 

The anti-EG-VEGF antibodies may comprise polyclonal antibodies. Methods of preparing polyclonal 
antibodies are known to the skilled artisan. Polyclonal antibodies can be raised in a mammal, for example, by one or 
more injections of an immunizing agent and, if desired, an adjuvant. Typically, the immunizing agent and/or adjuvant 
will be injected in the mammal by multiple subcutaneous or intraperitoneal injections. The immunizing agent may 

35 include the EG-VEGF polypeptide or a fusion protein thereof. It may be useful to conjugate the immunizing agent to a 
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protein known to be immunogenic in the mammal being immunized. Examples of such immunogenic proteins include 
hut are not limited to keyhole limpet hemocyanin, serum albumin, bovine thyroglobulin, and soybean trypsin inhibitor. 
Examples of adjuvants which may be employed include Freund's complete adjuvant and MPL-TDM adjuvant 
(monophosphoryl Lipid A, synthetic trehalose dicorynomycolate). The immunization protocol may be selected by one 
5 skilled in the art without undue experimentation. 

2. Monoclonal Antibodies 
The anti-EG-VEGF antibodies may, alternatively, be monoclonal antibodies. Monoclonal antibodies may be 
prepared using hybridoma methods, such as those described by Kohler and Milstein, Nature , 256:495 (1975). In a 

10 hybridoma method, a mouse, hamster, or other appropriate host animal, is typically immunized with an immunizing 
agent to elicit lymphocytes that produce or are capable of producing antibodies that will specifically bind to the 
immunizing agent. Alternatively, the lymphocytes may be immunized in vitro. 

The immunizing agent will typically include the EG-VEGF polypeptide or a fusion protein thereof. Generally, 
either peripheral blood lymphocytes ("PBLs") are used if cells of human origin are desired, or spleen cells or lymph 

15 node cells are used if non-human mammalian sources are desired. The lymphocytes are then fused with an 

immortalized cell line using a suitable fusing agent, such as polyethylene glycol, to form a hybridoma cell [Goding, 
Monoclonal Antibodies: Principles and Practice , Academic Press, (1986) pp. 59-103]. Immortalized cell lines are 
usually transformed mammalian cells, particularly myeloma cells of rodent, bovine and human origin. Usually, rat or 
mouse myeloma cell lines are employed. The hybridoma cells may be cultured in a suitable culture medium that 

20 preferably contains one or more substances that inhibit the growth or survival of the unfused, immortalized cells. For 
example, if the parental cells lack the enzyme hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT), the 
culture medium for the hybridomas typically will include hypoxanthine, aminopterin, and thymidine ("HAT medium"), 
which substances prevent the growth of HGPRT-deficient cells. 

Preferred immortalized cell lines are those that fuse efficiently, support stable high level expression of 

25 antibody by the selected antibody-producing cells, and are sensitive to a medium such as HAT medium. More 

preferred immortalized cell lines are murine myeloma lines, which can be obtained, for instance, from the Salk Institute 
Cell Distribution Center, San Diego, California and the American Type Culture Collection, Manassas, Virginia. Human 
myeloma and mouse-human heteromyeloma cell lines also have been described for the production of human monoclonal 
antibodies [Kozbor, J. Immunol. , J33:3001 (1984); Brodeur et al., Monoclonal Antibody Production Techniques and 

30 Applications , Marcel Dekker, Inc., New York, (1987) pp. 51-63]. 

The culture medium in which the hybridoma cells are cultured can then be assayed for the presence of 
monoclonal antibodies directed against EG-VEGF. Preferably, the binding specificity of monoclonal antibodies 
produced by the hybridoma cells is determined by immunoprecipitation or by an in vitro binding assay, such as 
radioimmunoassay (RIA) or enzyme-linked immunoabsorbent assay (ELISA). Such techniques and assays are known in 
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the art. The binding affinity of the monoclonal antibody can, for example, be determined by the Scatchard analysis of 
Munson and Pollard, Anal. Biochem. , 107:220 (1980). 

After the desired hybridoma cells are identified, the clones may be subcloned by limiting dilution procedures 
and grown by standard methods [Goding, supra ]. Suitable culture media for this purpose include, for example, 
5 Dulbecco's Modified Eagle's Medium and RPMI-1 640 medium. Alternatively, the hybridoma cells may be grown in vivo 

as ascites in a mammal. 

The monoclonal antibodies secreted by the subclones may be isolated or purified from the culture medium or 
ascites fluid by conventional immunoglobulin purification procedures such as, for example, protein A-Sepharose, 
hydroxylapatite chromatography, gel electrophoresis, dialysis, or affinity chromatography. 
10 The monoclonal antibodies may also be made by recombinant DNA methods, such as those described in U.S. 

Patent No. 4,816,567. DNA encoding the monoclonal antibodies of the invention can be readily isolated and 
sequenced using conventional procedures (e.g., by using oligonucleotide probes that are capable of binding specifically 
to genes encoding the heavy and light chains of murine antibodies). The hybridoma cells of the invention serve as a 
preferred source of such DNA. Once isolated, the DNA may be placed into expression vectors, which are then 
15 transfected into host cells such as simian COS cells, Chinese hamster ovary (CHO) cells, or myeloma cells that do not 
otherwise produce immunoglobulin protein, to obtain the synthesis of monoclonal antibodies in the recombinant host 
cells. The DNA also may be modified, for example, by substituting the coding sequence for human heavy and light 
chain constant domains in place of the homologous murine sequences [U.S. Patent No. 4,816,567; Morrison et al., 
supra) or by covalently joining to the immunoglobulin coding sequence all or part of the coding sequence for a non- 
20 immunoglobulin polypeptide. Such a non-immunoglobulin polypeptide can be substituted for the constant domains of 
an antibody of the invention, or can be substituted for the variable domains of one antigen-combining site of an 
antibody of the invention to create a chimeric bivalent antibody. 

The antibodies may be monovalent antibodies. Methods for preparing monovalent antibodies are well known 
in the art. For example, one method involves recombinant expression of immunoglobulin light chain and modified heavy 
25 chain. The heavy chain is truncated generally at any point in the Fc region so as to prevent heavy chain crosslinking. 

Alternatively, the relevant cysteine residues are substituted with another amino acid residue or are deleted so as to 
prevent crosslinking. 

In vitro methods are also suitable for preparing monovalent antibodies. Digestion of antibodies to produce 
fragments thereof, particularly, Fab fragments, can be accomplished using routine techniques known in the art. 

30 

3. Human and Humanized Antibodies 
The anti-EG-VEGF antibodies of the invention may further comprise humanized antibodies or human 
antibodies. Humanized forms of non-human (e.g., murine) antibodies are chimeric immunoglobulins, immunoglobulin 
chains or fragments thereof (such as Fv, Fab, Fab\ F(ab") z or other antigen-binding subsequences of antibodies) which 
35 contain minimal sequence derived from non-human immunoglobulin. Humanized antibodies include human 
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immunoglobulins (recipient antibody) in which residues from a complementary determining region (CDR) of the recipient 
are replaced by residues from a CDR of a non-human species (donor antibody) such as mouse, rat or rabbit having the 
desired specificity, affinity and capacity. In some instances, Fv framework residues of the human immunoglobulin are 
replaced by corresponding non-human residues. Humanized antibodies may also comprise residues which are found 
5 neither in the recipient antibody nor in the imported CDR or framework sequences. In general, the humanized antibody 

will comprise substantially all of at least one, and typically two, variable domains, in which all or substantially all of 
the CDR regions correspond to those of a non-human immunoglobulin and all or substantially all of the FR regions are 
those of a human immunoglobulin consensus sequence. The humanized antibody optimally also will comprise at least a 
portion of an immunoglobulin constant region (Fc), typically that of a human immunoglobulin [Jones et al.. Mature , 
10 321:522-525 (1986); Riechmann et al., Nature , 332:323-329 (1988); and Presta, Curr. Op. Struct. Biol. , 2:593-596 

(1992)]. 

Methods for humanizing non-human antibodies are well known in the art. Generally, a humanized antibody 
has one or more amino acid residues introduced into it from a source which is non-human. These non-human amino 
acid residues are often referred to as "import" residues, which are typically taken from an "import" variable domain. 

15 Humanization can be essentially performed following the method of Winter and co-workers [Jones et al.. Nature , 

321:522-525 (1986); Riechmann et al., Nature , 332:323-327 (1988); Verhoeyen et al.. Science , 239:1534-1536 
(1988)], by substituting rodent CDRs or CDR sequences for the corresponding sequences of a human antibody. 
Accordingly, such "humanized" antibodies are chimeric antibodies (U.S. Patent No. 4,816,567), wherein substantially 
less than an intact human variable domain has been substituted by the corresponding sequence from a non-human 

20 species. In practice, humanized antibodies are typically human antibodies in which some CDR residues and possibly 
some FR residues are substituted by residues from analogous sites in rodent antibodies. 

Human antibodies can also be produced using various techniques known in the art, including phage display 
libraries [Hoogenboom and Winter, J. Mol. Biol. , 227:381 (1991); Marks et al., J. Mol. Biol. , 222:581 (1991)]. The 
techniques of Cole et al. and Boerner et al. are also available for the preparation of human monoclonal antibodies (Cole 

25 et al., Monoclonal Antibodies and Cancer Therapy , Alan R. Liss, p. 77 (1985) and Boerner et al., J. Immunol. , 

147(11 :86-95 (1991)]. Similarly, human antibodies can be made by introducing of human immunoglobulin loci into 
transgenic animals, e.g., mice in which the endogenous immunoglobulin genes have been partially or completely 
inactivated. Upon challenge, human antibody production is observed, which closely resembles that seen in humans in 
all respects, including gene rearrangement, assembly, and antibody repertoire. This approach is described, for 

30 example, in U.S. Patent Nos. 5,545,807; 5,545,806; 5,569,825; 5,625,126; 5,633,425; 5,661,016, and in the 

following scientific publications: Marks et al., Bio/Technology 10, 779-783 (1992); Lonberg et al.. Nature 368 856- 
859 (1994); Morrison, Nature 368, 812-13 (1994); Fishwild et al., Nature Biotechnology 14, 845-51 (1996); 
Neuberger, Nature Biotechnology 14, 826 (1996); Lonberg and Huszar, Intern. Rev. Immunol. 13 65-93 (1995). 
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4. Bispecific Antibodies 
Bispecific antibodies are monoclonal, preferably human or humanized, antibodies that have binding 
specificities for at least two different antigens. In the present case, one of the binding specificities is for the EG- 
VEGF, the other one is for any other antigen, and preferably for a cell-surface protein or receptor or receptor subunit. 
5 Methods for making bispecific antibodies are known in the art. Traditionally, the recombinant production of 

bispecific antibodies is based on the co-expression of two immunoglobulin heavy-chain/light-chain pairs, where the two 
heavy chains have different specificities [Milstein and Cuello, Nature , 305:537-539 (1983)]. Because of the random 
assortment of immunoglobulin heavy and light chains, these hybridomas (quadromas) produce a potential mixture of 
ten different antibody molecules, of which only one has the correct bispecific structure. The purification of the 

10 correct molecule is usually accomplished by affinity chromatography steps. Similar procedures are disclosed in WO 
93/08829, published 13 May 1993, and in Traunecker et al., EMBOJ. , 10:3655-3659 (1991). 

Antibody variable domains with the desired binding specificities (antibody-antigen combining sites) can be 
fused to immunoglobulin constant domain sequences. The fusion preferably is with an immunoglobulin heavy-chain 
constant domain, comprising at least part of the hinge, CH2, and CH3 regions. It is preferred to have the first heavy- 

15 chain constant region (CH1) containing the site necessary for light-chain binding present in at least one of the fusions. 

DNAs encoding the immunoglobulin heavy-chain fusions and, if desired, the immunoglobulin light chain, are inserted 
into separate expression vectors, and are co-transfected into a suitable host organism. For further details of 
generating bispecific antibodies see, for example, Suresh et al.. Methods in Enzvmolonv , 121:210 (1986). 

According to another approach described in WO 96/27011, the interface between a pair of antibody 

20 molecules can be engineered to maximize the percentage of heterodimers which are recovered from recombinant cell 
culture. The preferred interface comprises at least a part of the CH3 region of an antibody constant domain. In this 
method, one or more small amino acid side chains from the interface of the first antibody molecule are replaced with 
larger side chains (e.g. tyrosine or tryptophan). Compensatory "cavities" of identical or similar size to the large side 
chain(s) are created on the interface of the second antibody molecule by replacing large amino acid side chains with 

25 smaller ones (e.g. alanine or threonine). This provides a mechanism for increasing the yield of the heterodimer over 

other unwanted end-products such as homodimers. 

Bispecific antibodies can be prepared as full length antibodies or antibody fragments (e.g. F(ab ) 2 bispecific 
antibodies). Techniques for generating bispecific antibodies from antibody fragments have been described in the 
literature. For example, bispecific antibodies can be prepared can be prepared using chemical linkage. Brennan et al, 

30 Science 229:81 (1985) describe a procedure wherein intact antibodies are proteolytically cleaved to generate F(ab') 2 

fragments. These fragments are reduced in the presence of the dithiol complexing agent sodium arsenite to stabilize 
vicinal dithiols and prevent intermolecular disulfide formation. The Fab' fragments generated are then converted to 
thionitrobenzoate (TNB) derivatives. One of the Fab'-TNB derivatives is then reconverted to the Fab'-thiol by reduction 
with mercaptoethylamine and is mixed with an equimolar amount of the other Fab'-TNB derivative to form the 
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bispecific antibody. The bispecific antibodies produced can be used as agents for the selective immobilization of 
enzymes. 

Fab' fragments may be directly recovered from£ coli and chemically coupled to form bispecific antibodies. 
Shalaby et al., J. Exp. Med. 175:217-225 (1992) describe the production of a fully humanized bispecific antibody 
5 F(ab') 2 molecule. Each Fab' fragment was separately secreted from £ coli and subjected to directed chemical coupling 

in vitro to form the bispecific antibody. The bispecific antibody thus formed was able to bind to cells overexpressing 
the ErbB2 receptor and normal human T cells, as well as trigger the lytic activity of human cytotoxic lymphocytes 
against human breast tumor targets. 

Various technique for making and isolating bispecific antibody fragments directly from recombinant cell 

10 culture have also been described. For example, bispecific antibodies have been produced using leucine zippers. 

Kostelny et al., J. Immunol. 148(5):1547-1553 (1992). The leucine zipper peptides from the Fos and Jun proteins 
were linked to the Fab' portions of two different antibodies by gene fusion. The antibody homodimers were reduced 
at the hinge region to form monomers and then re-oxidized to form the antibody heterodimers. This method can also 
be utilized for the production of antibody homodimers. The "diabody" technology described by Hollinger et al., Proc. 

15 Natl. Acad. Sci. USA 90:6444-6448 (1993) has provided an alternative mechanism for making bispecific antibody 

fragments. The fragments comprise a heavy-chain variable domain (V H ) connected to a light-chain variable domain (V L ) 
by a linker which is too short to allow pairing between the two domains on the same chain. Accordingly, the \ and V L 
domains of one fragment are forced to pair with the complementary V L and V H domains of another fragment, thereby 
forming two antigen-binding sites. Another strategy for making bispecific antibody fragments by the use of 

20 single-chain Fv (sFv) dimers has also been reported. See, Gruber et al., J. Immunol. 152:5368 (1994). 

Antibodies with more than two valencies are contemplated. For example, trispecific antibodies can be prepared. Tutt 
eta/.. J. Immunol. 147:60 (1991). 

Exemplary bispecific antibodies may bind to two different epitopes on a given EG-VEGF polypeptide herein. 
Alternatively, an anti-EG-VEGF polypeptide arm may be combined with an arm which binds to a triggering molecule on 

25 a leukocyte such as a T-cell receptor molecule (e.g. CD2, CD3, CD28, or B7), or Fc receptors for IgG (Fc R), such as 

Fc Rl (CD64), Fc Rll (CD32) and Fc Rill (CD 16) so as to focus cellular defense mechanisms to the cell expressing the 
particular EG-VEGF polypeptide. Bispecific antibodies may also be used to localize cytotoxic agents to cells which 
express a particular EG-VEGF polypeptide. These antibodies possess a EG-VEGF-binding arm and an arm which binds a 
cytotoxic agent or a radionuclide chelator, such as EOTUBE, DPTA, DOTA, or TETA. Another bispecific antibody of 

30 interest binds the EG-VEGF polypeptide and further binds tissue factor (TF). 

5. Heteroconjugate Antibodies 
Heteroconjugate antibodies are also within the scope of the present invention. Heteroconjugate antibodies 
are composed of two covalently joined antibodies. Such antibodies have, for example, been proposed to target 
35 immune system cells to unwanted cells [U.S. Patent No. 4,676,980], and for treatment of HIV infection [WO 
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91/00360; WO 92/200373; EP 03089]. It is contemplated that the antibodies may be prepared in vitro using known 
methods in synthetic protein chemistry, including those involving crosslinking agents. For example, immunotoxins may 
be constructed using a disulfide exchange reaction or by forming a thioether bond. Examples of suitable reagents for 
this purpose include iminothiolate and methyl-4-mercaptobutyrimidate and those disclosed, for example, in U.S. Patent 
5 No. 4,676,980. 

6. Effector Function Engineering 

It may be desirable to modify the antibody of the invention with respect to effector function, so as to 
enhance, e.g., the effectiveness of the antibody in treating cancer. For example, cysteine residue(s) may be introduced 

10 into the Fc region, thereby allowing interchain disulfide bond formation in this region. The homodimerie antibody thus 

generated may have improved internalization capability and/or increased complement-mediated cell killing and 
antibody-dependent cellular cytotoxicity (ADCC). See Caron eta/., J. Exp Med ., 176 : 1 191-1 195 (1992) and Shopes, 
J. Immunol .. 148 : 2918-2922 (1992). Homodimerie antibodies with enhanced anti-tumor activity may also be 
prepared using heterobifunctional cross-linkers as described in Wolff etai. Cancer Research , 53: 2560-2565 (1993). 

15 Alternatively, an antibody can be engineered that has dual Fc regions and may thereby have enhanced complement 

lysis and ADCC capabilities. See Stevenson eta/., Anti-Cancer Drug Design, 3: 219-230 (1989). 

7. Immunoconiugates 

The invention also pertains to immunoconjugates comprising an antibody conjugated to a cytotoxic agent 

20 such as a chemotherapeutic agent, toxin (e.g., an enzymatically active toxin of bacterial, fungal, plant, or animal 

origin, or fragments thereof), or a radioactive isotope {i.e., a radioconjugate). 

Chemotherapeutic agents useful in the generation of such immunoconjugates have been described above. 
Enzymatically active toxins and fragments thereof that can be used include diphtheria A chain, nonbinding active 
fragments of diphtheria toxin, exotoxin A chain (from Pseudomonas aeruginosa), ricin A chain, abrin A chain, modeccin 

25 A chain, alpha-sarcin, Aleurites fordii proteins, dianthin proteins, Phytolaca americana proteins (PAPI, PAPII, and 

PAP-S), momordica charantia inhibitor, curcin, crotin, sapaonaria officinalis inhibitor, gelonin, mitogellin, restrictocin, 
phenomycin, enomycin, and the tricothecenes. A variety of radionuclides are available for the production of 
radioconjugated antibodies. Examples include 212 Bi, 131 l, 131 ln, 90 Y, and ,86 Re. 

Conjugates of the antibody and cytotoxic agent are made using a variety of Afunctional protein-coupling 

30 agents such as l\l-succinimidyl-3-(2-pyridyldithiol) propionate (SPDP), iminothiolane (IT), bifunctional derivatives of 

imidoesters (such as dimethyl adipimidate HCL), active esters (such as disuccinimidyl suberate), aldehydes (such as 
glutareldehyde), bis-azido compounds (such as bis (p-azidobenzoyl) hexanediamine), bis-diazonium derivatives (such as 
bis-(p-diazoniumbenzoyl)-ethylenediamine), diisocyanates (such as tolyene 2,6-diisocyanate), and bis-active fluorine 
compounds (such as 1,5-difluoro-2,4-dinitrobenzene). For example, a ricin immunotoxin can be prepared as described 

35 in Vitetta et a/., Science, 238 : 1098 (1987). Carbon-14-labeled 1-isothiocyanatobenzyl-3-methyldiethylene 
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triaminepentaacetic acid (MX-DTPA) is an exemplary chelating agent for conjugation of radionucleotide to the 
antibody. See W094/1 1026. 

In another embodiment, the antibody may be conjugated to a "receptor" (such streptavidin) for utilization in 
tumor pretargeting wherein the antibody-receptor conjugate is administered to the patient, followed by removal of 
5 unbound conjugate from the circulation using a clearing agent and then administration of a "ligand" [e.g., avidin) that 

is conjugated to a cytotoxic agent (e.g., a radionucleotide). 

8. Immunoliposomes 

The antibodies disclosed herein may also be formulated as immunoliposomes. Liposomes containing the 
10 antibody are prepared by methods known in the art, such as described in Epstein et al., Proc. Natl. Acad. Sci. USA , 

82: 3688 (1985); Hwang et al., Proc. Natl Acad. Sci. USA, 77: 4030 (1980); and U.S. Pat. Nos. 4,485,045 and 
4,544,545. Liposomes with enhanced circulation time are disclosed in U.S. Patent No. 5,013,556. 

Particularly useful liposomes can he generated by the reverse-phase evaporation method with a lipid 
composition comprising phosphatidylcholine, cholesterol, and PEG-derivatized phosphatidylethanolamine (PEG-PE). 
15 Liposomes are extruded through filters of defined pore size to yield liposomes with the desired diameter. Fab' 

fragments of the antibody of the present invention can be conjugated to the liposomes as described in Martin etal.,J± 
Biol. Chem. , 257 : 286-288 (1982) via a disulfide-interchange reaction. A chemotherapeutic agent (such as 
Doxorubicin) is optionally contained within the liposome. See Gabizon et al., J. National Cancer Inst. , 81(19): 1484 
(1989). 

20 

9. Pharmaceutical Compositions of Antibodies 

Antibodies specifically binding a EG-VEGF polypeptide identified herein, as well as other molecules identified 
by the screening assays disclosed hereinbefore, can be administered for the treatment of various disorders in the form 
of pharmaceutical compositions. 

25 If the EG-VEGF polypeptide is targeted intracellular^ and whole antibodies are used as inhibitors, 

internalizing antibodies are preferred. However, lipofections or liposomes can also be used to deliver the antibody, or 
an antibody fragment, into cells. Where antibody fragments are used, the smallest inhibitory fragment that 
specifically binds to the binding domain of the target protein is preferred. For example, based upon the variable-region 
sequences of an antibody, peptide molecules can be designed that retain the ability to bind the target protein 

30 sequence. Such peptides can be synthesized chemically and/or produced by recombinant DNA technology. See, e.g., 

Marasco et al., Proc. Natl. Acad. Sci. USA , 90: 7889-7893 (1 993). The formulation herein may also contain more than 
one active compound as necessary for the particular indication being treated, preferably those with complementary 
activities that do not adversely affect each other. Alternatively, or in addition, the composition may comprise an 
agent that enhances its function, such as, for example, a cytotoxic agent, cytokine, chemotherapeutic agent, or 
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growth-inhibitory agent. Such molecules are suitably present in combination in amounts that are effective for the 
purpose intended. 

The active ingredients may also be entrapped in microcapsules prepared, for example, by coacervation 
techniques or by interfacial polymerization, for example, hydroxymethylcellulose or gelatin-microcapsules and poly- 
5 (methylmethacylate) microcapsules, respectively, in colloidal drug delivery systems (for example, liposomes, albumin 

microspheres, microemulsions, nano-particles, and nanocapsules) or in macroemulsions. Such techniques are disclosed 
in Remington's Pharmaceutical Sciences , supra. 

The formulations to be used for in vivo administration must be sterile. This is readily accomplished by 
filtration through sterile filtration membranes. 

10 Sustained-release preparations may be prepared. Suitable examples of sustained-release preparations 

include semipermeable matrices of solid hydrophobic polymers containing the antibody, which matrices are in the form 
of shaped articles, e.g., films, or microcapsules. Examples of sustained-release matrices include polyesters, hydrogels 
(for example, poly(2-hydroxyethyl-methacrylate), or poly(vinylalcohol)), polylactides (U.S. Pat. No. 3,773,919), 
copolymers of L-glutamic acid and ethyl-L-glutamate, non-degradable ethylene-vinyl acetate, degradable lactic acid- 

15 glycolic acid copolymers such as the LUPRON DEPOT ™ (injectable microspheres composed of lactic acid-glycolic acid 

copolymer and leuprolide acetate), and poly-D-(-)-3-hydroxybutyric acid. While polymers such as ethylene-vinyl acetate 
and lactic acid-glycolic acid enable release of molecules for over 100 days, certain hydrogels release proteins for 
shorter time periods. When encapsulated antibodies remain in the body for a long time, they may denature or 
aggregate as a result of exposure to moisture at 37 D C, resulting in a loss of biological activity and possible changes in 

20 immunogenicity. Rational strategies can be devised for stabilization depending on the mechanism involved. For 
example, if the aggregation mechanism is discovered to be intermolecular S-S bond formation through thio-disulfide 
interchange, stabilization may be achieved by modifying sulfhydryl residues, lyophilizing from acidic solutions, 
controlling moisture content, using appropriate additives, and developing specific polymer matrix compositions. 
G. Uses for anti-EG-VEGF Antibodies 

25 The anti-EG-VEGF antibodies of the invention have various utilities. For example, anti-EG-VEGF antibodies 

may be used in diagnostic assays for EG-VEGF, e.g„ detecting its expression in specific cells, tissues, or serum. 
Various diagnostic assay techniques known in the art may be used, such as competitive binding assays, direct or 
indirect sandwich assays and immunoprecipitation assays conducted in either heterogeneous or homogeneous phases 
[Zola, Monoclonal Antibodies: A Manual of Techniques , CRC Press, Inc. (1987) pp. 147-158]. The antibodies used in 

30 the diagnostic assays can he labeled with a detectable moiety. The detectable moiety should be capable of producing, 

either directly or indirectly, a detectable signal. For example, the detectable moiety may be a radioisotope, such as 3 H, 
,4 C, 32 P, 35 S, or 125 l, a fluorescent or chemiluminescent compound, such as fluorescein isothiocyanate, rhodamine, or 
luciferin, or an enzyme, such as alkaline phosphatase, beta-galactosidase or horseradish peroxidase. Any method 
known in the art for conjugating the antibody to the detectable moiety may be employed, including those methods 
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described by Hunter et al., Nature , 144:945 (1962); David et al., Biochemistry , 13:1014 (1974); Pain et al., 1 
Immunol. Meth. , 40:219 (1981); and Nygren, J. Histochem. and Cvtochem., 30:407 (1982). 

Anti-EG-VEGF antibodies also are useful for the affinity purification of EG-VEGF from recombinant cell 
culture or natural sources. In this process, the antibodies against EG-VEGF are immobilized on a suitable support, such 
5 a Sephadex resin or filter paper, using methods well known in the art. The immobilized antibody then is contacted 

with a sample containing the EG-VEGF to be purified, and thereafter the support is washed with a suitable solvent that 
will remove substantially all the material in the sample except the EG-VEGF, which is bound to the immobilized 
antibody. Finally, the support is washed with another suitable solvent that will release the EG-VEGF from the 
antibody, in addition, anti-EG-VEGF antibodies are useful as therapeutic and diagnostic agent, and can be used for the 
10 treatment and/or diagnosis of the conditions discussed before in connection with EG-VEGF and EG-VEGF antagonists. 

The following examples are offered for illustrative purposes only, and are not intended to limit the scope of 
the present invention in any way. 

All patent and literature references cited in the present specification are hereby incorporated by reference in 
their entirety. 

15 

EXAMPLES 

Commercially available reagents referred to in the examples were used according to manufacturer's 
instructions unless otherwise indicated. The source of those cells identified in the following examples, and throughout 
the specification, by ATCC accession numbers is the American Type Culture Collection, Manassas, VA. 

20 

EXAMPLE 1 

Isolation of cDNA Clones Encoding a Human EG-VEGF 
DNA60621-1516 was identified by applying a proprietary signal sequence finding algorithm (Figure 20A-Q) 
developed by Genentech, Inc. (South San Francisco, CA) upon ESTs as well as clustered and assembled EST fragments 
25 from public (e.g., GenBank) and/or private (LIFESEQ™, Incyte Pharmaceuticals, Inc., Palo Alto, CA) databases. The 

signal sequence algorithm computes a secretion signal score based on the character of the DMA nucleotides 
surrounding the first and optionally the second methionine codon(s) (ATG) at the 5'-end of the sequence or sequence 
fragment under consideration. The nucleotides following the first ATG must code for at least 35 unambiguous amino 
acids without any stop codons. If the first ATG has the required amino acids, the second is not examined. If neither 
30 meets the requirement, the candidate sequence is not scored. In order to determine whether the EST sequence 

contains an authentic signal sequence, the DI\IA and corresponding amino acid sequences surrounding the ATG codon 
are scored using a set of seven sensors (evaluation parameters) known to be associated with secretion signals. 

Use of the above described signal sequence algorithm allowed identification of an EST cluster sequence from 
the LIFESEQ™ database, Incyte Pharmaceuticals, Palo Alto, database, designated herein as DNA1 57032. This EST 
35 cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included public 
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EST databases (e.g., GenBank) and a proprietary EST DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, 
CA) to identify existing homologies. The homology search was performed using the computer program BLAST or 
BLAST2 (Altshul et al., Methods in Enzvmoloav 266:460-480 (1996)]. Those comparisons resulting in a BLAST score 
of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a 
5 consensus DMA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 

consensus sequence obtained therefrom is shown in Figure 4 (SEQ ID N0:3), wherein the consensus sequence is herein 
designated DNA56748. 

In light of an observed sequence homology between the DMA56748 sequence and an EST sequence 
encompassed within clone no. 3476792 from the Incyte database, done no. 3476792 was purchased and the cDNA 

10 insert was obtained and sequenced. Clone no. 3476792 was isolated from a library from ovarian tissue. Sectioning 
of the tissue found the posterior serosa contained a focus of endometriosis. Pathology for the associated tumor 
tissue indicated multiple leiomyomata, ranging in size. It was found herein that that cDNA insert encoded a full-length 
protein. The sequence of this cDNA insert is shown in Figure 1 and is herein designated as DNA60621-1516. 

Clone DWA6Q621 -1 51 6 contains a single open reading frame with an apparent translational initiation site at 

15 nucleotide positions 91-93 and ending at the stop codon at nucleotide positions 406-408 (Figure 1). The predicted 

polypeptide precursor is 105 amino acids long (Figure 2). The full-length EG-VEGF protein shown in Figure 2 has an 
estimated molecular weight of about 11715 daltons and a pi of about 9.05. Mature EG-VEGF is an 8600 dalton 
protein encoded by a cDNA cloned from a human ovary library. The 1.4 kilobase cDNA encodes a protein of 105 
amino acids with a well defined signal sequence. EG-VEGF is a cysteine-rich protein comprised of a colipase fold 

20 motif. Of the 86 amino acids expected in the mature protein, 10 are cysteines (Figures 15a and b). The protein has a 

series of short beta strands with large connecting loops which are held together by disulfide bonds resulting in a flat 
fold with finger-like projections that act as interactive surfaces. Analysis of the full-length EG-VEGF sequence shown 
in Figure 2 (SEO ID l\IO:2) evidences the presence of a variety of important polypeptide domains as shown in Figure 2, 
wherein the locations given for those important polypeptide domains are approximate as described above. Clone 

25 DNA6062M516 has been deposited with ATCC on August 4, 1998 and is assigned ATCC Deposit No. 203091. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using the ALIGN-2 sequence alignment 
analysis of the full-length sequence shown in Figure 2 (SEQ ID I\I0:2), evidenced sequence identity between the EG- 
VEGF amino acid sequence and the following Dayhoff sequences: VRPA_DENP0, LFE4_CHICK, AF034208J, 
AF030433J, A55035, COLRABIT, CELB0507 9, S67826J, S34665 and CRU73817J. 

30 Remarkably, EG-VEGF displays a high degree of homology (80%) and identity (63%) to a non-toxic protein 

purified from the venom of black mamba snake Dendroaspis polylepis polylepis, and named venom protein A (VPRA) 
[Joubert and Strydom, Hoppe-Sevlers Zeitschrift fur Phvs. Chemie , 361:1787-1794 (1980)] and a high degree of 
homology (76%) and identity (58%) to a human molecule closely related to a peptide isolated UmBombina variegata, 
and termed "Bv8" [Wechselberger et al., FEBS Lett., 462:177-181 (1999)]. Figure 15b illustrates this homology. In 

35 Figure 15b, boxed residues indicate identity and human EG-VEGF, snake VPRA, and human BV8 homologue (Bv8 horn) 
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amino acid sequences are indicated. Notably, the number and spacing of cysteines for EG-VEGF are completely 
conserved. Thus, EG-VEGF is the human orthologue or a closely related homologue of VPRA and Bv8. EG-VEGF 
diplays a more limited but significant homology with the cysteine-rich carboxyl sequence of the Xenopus head 
organizer dickkopf, an inhibitor of wnt signaling, and to colipase, as can be seen in Figure 1 5c [Glinka et al.. Mature , 
5 391:357-362; Aravind and Koonin, Curr. Biology , 8:477-478 (1 998)]. Figure 1 5c is an alignment of human EG-VEGF, 

human dickkopf-3 (hdkk3) [Kmpnik et al.. Gene, 238:301-313 (1999)], Xenopus dkk-1 (xdkkl) [Glinka et al.. Nature, 
39J.:357-362 (1998)] and porcine colipase (col). This illustrates the conserved cysteines which form the 
characteristic disulfide-bonding pattern of the colipase fold domain [van Tilbeurgh et al.. Nature , 359:159-162 
(1992)]. This motif in EG-VEGF is 37% identical and 41% homologous to the cysteine rich C-terminal domain of 
10 human dkk-3 and 32% identical, 42% homologous to the Xenopus dkk-1 domain. Numbers indicate amino acid 
position in the respective protein and boxed residues are identical to EG-VEGF. 



EXAMPLE 2 
Use of EG-VEGF as a hybridization probe 
15 The following method describes use of a nucleotide sequence encoding EG-VEGF as a hybridization probe. 

DNA comprising the coding sequence of full-length or mature EG-VEGF is employed as a probe to screen for 
homologous DNAs (such as those encoding naturally-occurring variants of EG-VEGF) in human tissue cDNA libraries or 
human tissue genomic libraries. 

Hybridization and washing of filters containing either library DNAs is performed under the following high 
20 stringency conditions. Hybridization of radiolabeled EG-VEGF-derived probe to the filters is performed in a solution of 

50% formamide, 5x SSC, 0.1% SDS, 0.1% sodium pyrophosphate, 50 mM sodium phosphate, pH 6.8, 2x Denhardt's 
solution, and 10% dextran sulfate at 42°C for 20 hours. Washing of the filters is performed in an aqueous solution of 
O.lxSSC and 0.1% SDS at42°C. 

DNAs having a desired sequence identity with the DNA encoding full-length native sequence EG-VEGF can 
25 then be identified using standard techniques known in the art. 
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EXAMPLE 3 
Expression of EG-VEGF in £ coli 
This example illustrates preparation of an ungiycosylated form of EG-VEGF by recombinant expression in £ 

coli. 

5 The DNA sequence encoding EG-VEGF is initially amplified using selected PCR primers. The primers should 

contain restriction enzyme sites which correspond to the restriction enzyme sites on the selected expression vector. A 
variety of expression vectors may be employed. An example of a suitable vector is pBR322 (derived from £ coli; see 
Bolivar et al., Gene, 2:95 (1977)) which contains genes for ampicillin and tetracycline resistance. The vector is 
digested with restriction enzyme and dephosphorylated. The PCR amplified sequences are then ligated into the vector. 

10 The vector will preferably include sequences which encode for an antibiotic resistance gene, a trp promoter, a polyhis 
leader (including the first six STII codons, polyhis sequence, and enterokinase cleavage site), the EG-VEGF coding 
region, lambda transcriptional terminator, and an argU gene. 

The ligation mixture is then used to transform a selected £ coli strain using the methods described in 
Sambrook et al., supra . Transformants are identified by their ability to grow on LB plates and antibiotic resistant 

15 colonies are then selected. Plasmid DNA can be isolated and confirmed by restriction analysis and DNA sequencing. 

Selected clones can be grown overnight in liquid culture medium such as LB broth supplemented with 
antibiotics. The overnight culture may subsequently be used to inoculate a larger scale culture. The cells are then 
grown to a desired optical density, during which the expression promoter is turned on. 

After culturing the cells for several more hours, the cells can be harvested by centrifugation. The cell pellet 

20 obtained by the centrifugation can be solubilized using various agents known in the art, and the solubilized EG-VEGF 

protein can then be purified using a metal chelating column under conditions that allow tight binding of the protein. 

EG-VEGF may be expressed in £ coli in a poly-His tagged form, using the following procedure. The DMA 
encoding EG-VEGF is initially amplified using selected PCR primers. The primers will contain restriction enzyme sites 
which correspond to the restriction enzyme sites on the selected expression vector, and other useful sequences 

25 providing for efficient and reliable translation initiation, rapid purification on a metal chelation column, and proteolytic 

removal with enterokinase. The PCR-amplified, poly-His tagged sequences are then ligated into an expression vector, 
which is used to transform an £ «7//host based on strain 52 (W3110 fuhA(tonA) Ion galE rpoHts(htpRts) clpP(laclq). 
Transformants are first grown in LB containing 50 mgfml carbenicillin at 30 D C with shaking until an O.D.600 of 3-5 is 
reached. Cultures are then diluted 50-100 fold into CRAP media (prepared by mixing 3.57 g (NH 4 ) 2 S0 4 , 0.71 g sodium 

30 citrate 2H20, 1 .07 g KCI, 5.36 g Difco yeast extract, 5.36 g Sheffield hycase SF in 500 mL water, as well as 1 10 

mM MPOS, pH 7.3, 0.55% (w/v) glucose and 7 mM MgS0 4 ) and grown for approximately 20-30 hours at 30°C with 
shaking. Samples are removed to verify expression by SDS-PAGE analysis, and the bulk culture is centrifuged to pellet 
the cells. Cell pellets are frozen until purification and refolding. 

£ coli paste from 0.5 to 1 L fermentations (6-10 g pellets) is resuspended in 10 volumes (w|v) in 7 M 

35 guanidine, 20 mM Tris, pH 8 buffer. Solid sodium sulfite and sodium tetrathionate is added to make final 



WO 02/00711 



PCT/US01/20116 



concentrations of 0.1 M and 0.02 M, respectively, and the solution is stirred overnight at 4 D C. This step results in a 
denatured protein with all cysteine residues blocked by sulfitolization. The solution is centrifuged at 40,000 rpm in a 
Beckman Ultracentifuge for 30 min. The supernatant is diluted with 3-5 volumes of metal chelate column buffer (6 M 
guanidine, 20 mM Tris, pH 7.4) and filtered through 0.22 micron filters to clarify. The clarified extract is loaded onto 
5 a 5 ml Qiagen Ni-NTA metal chelate column equilibrated in the metal chelate column buffer. The column is washed 

with additional buffer containing 50 mM imidazole (Calbiochem, Utrol grade), pH 7.4. The protein is eluted with buffer 
containing 250 mM imidazole. Fractions containing the desired protein are pooled and stored at 4°C. Protein 
concentration is estimated by its absorbance at 280 nm using the calculated extinction coefficient based on its amino 
acid sequence. 

10 The proteins are refolded by diluting the sample slowly into freshly prepared refolding buffer consisting of: 

20 mM Tris, pH 8.6, 0.3 M MaCI, 2.5 M urea, 5 mM cysteine, 20 mM glycine and 1 mM EDTA. Refolding volumes 
are chosen so that the final protein concentration is between 50 to 100 micrograms/ml. The refolding solution is 
stirred gently at 4 G C for 12-36 hours. The refolding reaction is quenched by the addition of TFA to a final 
concentration of 0.4% (pH of approximately 3). Before further purification of the protein, the solution is filtered 

15 through a 0.22 micron filter and acetonitrile is added to 2-10% final concentration. The refolded protein is 

chromatographed on a Poros R1/H reversed phase column using a mobile buffer of 0.1% TFA with elution with a 
gradient of acetonitrile from 10 to 80%. Aliquots of fractions with A280 absorbance are analyzed on SDS 
polyacrylamide gels and fractions containing homogeneous refolded protein are pooled. Generally, the properly 
refolded species of most proteins are eluted at the lowest concentrations of acetonitrile since those species are the 

20 most compact with their hydrophobic interiors shielded from interaction with the reversed phase resin. Aggregated 

species are usually eluted at higher acetonitrile concentrations. In addition to resolving misfolded forms of proteins 
from the desired form, the reversed phase step also removes endotoxin from the samples. 

Fractions containing the desired folded EG-VEGF polypeptide are pooled and the acetonitrile removed using a 
gentle stream of nitrogen directed at the solution. Proteins are formulated into 20 mM Hepes, pH 6.8 with 0.14 M 

25 sodium chloride and 4% mannitol by dialysis or by gel filtration using G25 Superfine (Pharmacia) resins equilibrated in 

the formulation buffer and sterile filtered. 

EXAMPLE 4 
Expression of EG-VEGF in mammalian cells 
30 This example illustrates preparation of a potentially glycosylated form of EG-VEGF by recombinant 

expression in mammalian cells. 

The vector, pRK5 (see EP 307,247, published March 15, 1989), is employed as the expression vector. 
Optionally, the EG-VEGF DNA is ligated into pRK5 with selected restriction enzymes to allow insertion of the EG-VEGF 
DNA using ligation methods such as described in Sambrook et al., supra . The resulting vector is called pRK5-EG-VEGF. 



-58- 



WO 02/00711 



PCT/US01/20116 



In one embodiment, the selected host cells may be 293 cells. Human 293 cells (ATCC CCL 1573) are grown 
to confluence in tissue culture plates in medium such as DMEM supplemented with fetal calf serum and optionally, 
nutrient components and/or antibiotics. About 10 g pRK5-EG-VEGF DNA is mixed with about 1 g DNA encoding the 
VA RNA gene [Thimmappaya et al, CeH, 31:543 (1982)] and dissolved in 500 I of 1 mM Tris-HCI, 0.1 mM EDTA, 
5 0.227 M CaCI 2 . To this mixture is added, dropwise, 500 I of 50 mM HEPES (pH 7.35), 280 mM NaCI, 1.5 mM 

NaP0 4 , and a precipitate is allowed to form for 10 minutes at 25 C C. The precipitate is suspended and added to the 
293 cells and allowed to settle for about four hours at 37°C. The culture medium is aspirated off and 2 ml of 20% 
glycerol in PBS is added for 30 seconds. The 293 cells are then washed with serum free medium, fresh medium is 
added and the cells are incubated for about 5 days. 

10 Approximately 24 hours after the transfections, the culture medium is removed and replaced with culture 

medium (alone) or culture medium containing 200 Ci/ml 35 S-cysteine and 200 Ci/ml 35 S-methionine. After a 12 hour 
incubation, the conditioned medium is collected, concentrated on a spin filter, and loaded onto a 1 5% SDS gel. The 
processed gel may be dried and exposed to film for a selected period of time to reveal the presence of EG-VEGF 
polypeptide. The cultures containing transfected cells may undergo further incubation (in serum free medium) and the 

1 5 medium is tested in selected bioassays. 

In an alternative technique, EG-VEGF may be introduced into 293 cells transiently using the dextran sulfate 
method described by Somparyrac et al., Proc. Natl. Acad. Sci„ 12:7575 (1981). 293 cells are grown to maximal 
density in a spinner flask and 700 g pRK5-EG-VEGF DMA is added. The cells are first concentrated from the spinner 
flask by centrifugation and washed with PBS. The DIMA-dextran precipitate is incubated on the cell pellet for four 

20 hours. The cells are treated with 20% glycerol for 90 seconds, washed with tissue culture medium, and re-introduced 
into the spinner flask containing tissue culture medium, 5 g/ml bovine insulin and 0.1 g/ml bovine transferrin. After 
about four days, the conditioned media is centrifuged and filtered to remove cells and debris. The sample containing 
expressed EG-VEGF can then be concentrated and purified by any selected method, such as dialysis and/or column 
chromatography. 

25 In another embodiment, EG-VEGF can be expressed in CHO cells. The pRK5-EG-VEGF can be transfected into 

CHO cells using known reagents such as CaP0 4 or DEAE-dextran. As described above, the cell cultures can be 
incubated, and the medium replaced with culture medium (alone) or medium containing a radiolabel such as 35 S- 
methionine. After determining the presence of EG-VEGF polypeptide, the culture medium may be replaced with serum 
free medium. Preferably, the cultures are incubated for about 6 days, and then the conditioned medium is harvested. 

30 The medium containing the expressed EG-VEGF can then be concentrated and purified by any selected method. 

Epitope-tagged EG-VEGF may also be expressed in host CHO cells. The EG-VEGF may be subcloned out of 
the pRK5 vector. The subclone insert can undergo PCR to fuse in frame with a selected epitope tag such as a poly-his 
tag into a Baculovirus expression vector. The poly-his tagged EG-VEGF insert can then be subcloned into a SV40 
driven vector containing a selection marker such as DHFR for selection of stable clones. Finally, the CHO cells can be 

35 transfected (as described above) with the SV40 driven vector. Labeling may be performed, as described above, to 
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verify expression. The culture medium containing the expressed poly-His tagged EG-VEGF can then be concentrated 
and purified by any selected method, such as by l\li 2+ -chelate affinity chromatography. 

EG-VEGF may also be expressed in CHO and/or COS cells by a transient expression procedure or in CHO cells 
by another stable expression procedure. 
5 Stable expression in CHO cells is performed using the following procedure. The proteins are expressed as an 

IgG construct (immunoadhesin), in which the coding sequences for the soluble forms (e.g. extracellular domains) of the 
respective proteins are fused to an lgG1 constant region sequence containing the hinge, CH2 and CH2 domains and/or 
is a poly-His tagged form. 

Following PCR amplification, the respective DMAs are subcloned in a CHO expression vector using standard 

10 techniques as described in Ausuhel et al.. Current Protocols of Molecular Biology , Unit 3.16, John Wiley and Sons 

(1997). CHO expression vectors are constructed to have compatible restriction sites 5' and 3' of the DIMA of interest 
to allow the convenient shuttling of cDNA's. The vector used expression in CHO cells is as described in Lucas et al., 
Nucl. Acids Res. 24:9 (1774-1779 (1996), and uses the SV40 early promoter/enhancer to drive expression of the 
cDNA of interest and dihydrofolate reductase (DHFR). DHFR expression permits selection for stable maintenance of 

1 5 the plasmid following transf ection. 

Twelve micrograms of the desired plasmid DMA is introduced into approximately 10 million CHO cells using 
commercially available transfection reagents Superfect '" (Quiagen), Oosper™ or Fugene™ (Boehringer Mannheim). The 
cells are grown as described in Lucas et al., supra . Approximately 3 x 10 7 cells are frozen in an ampule for further 
growth and production as described below. 

20 The ampules containing the plasmid DIM are thawed by placement into water bath and mixed by vortexing. 

The contents are pipetted into a centrifuge tube containing 10 mLs of media and centrifuged at 1000 rpm for 5 
minutes. The supernatant is aspirated and the cells are resuspended in 10 mL of selective media (0.2 m filtered PS20 
with 5% 0.2 m diafiltered fetal bovine serum). The cells are then aliquoted into a 100 mL spinner containing 90 mL 
of selective media. After 1-2 days, the cells are transferred into a 250 mL spinner filled with 1 50 mL selective growth 

25 medium and incubated at 37°C. After another 2-3 days, 250 mL, 500 mL and 2000 mL spinners are seeded with 3 x 

10 5 cells/mL. The cell media is exchanged with fresh media by centrifugation and resuspension in production medium. 
Although any suitable CHO media may be employed, a production medium described in U.S. Patent No. 5,122,469, 
issued June 16, 1992 may actually be used. A 3L production spinner is seeded at 1.2 x 10 e cells/mL On day 0, the 
cell number pH ie determined. On day 1, the spinner is sampled and sparging with filtered air is commenced. On day 

30 2, the spinner is sampled, the temperature shifted to 33°C, and 30 mL of 500 g/L glucose and 0.6 mL of 10% 

antifoam (e.g., 35% polydimethylsiloxane emulsion, Dow Coming 365 Medical Grade Emulsion) taken. Throughout the 
production, the pH is adjusted as necessary to keep it at around 7.2. After 10 days, or until the viability dropped 
below 70%, the cell culture is harvested by centrifugation and filtering through a 0.22 m filter. The filtrate was 
either stored at 4°C or immediately loaded onto columns for purification. 
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For the poly-His tagged constructs, the proteins are purified using a Ni-NTA column (Qiagen). Before 
purification, imidazole is added to the conditioned media to a concentration of 5 mM. The conditioned media is 
pumped onto a 6 ml Ni-NTA column equilibrated in 20 mM Hepes, pH 7.4, buffer containing 0.3 M NaCI and 5 mM 
imidazole at a flow rate of 4-5 ml/min. at 4"C. After loading, the column is washed with additional equilibration buffer 
5 and the protein eluted with equilibration buffer containing 0.25 M imidazole. The highly purified protein is 

subsequently desalted into a storage buffer containing 10 mM Hepes, 0.14 M NaCI and 4% mannitol, pH 6.8, with a 
25 ml G25 Superfine (Pharmacia) column and stored at -80°C. 

Immunoadhesin (Fc-containing) constructs are purified from the conditioned media as follows. The 
conditioned medium is pumped onto a 5 ml Protein A column (Pharmacia) which had been equilibrated in 20 mM Na 
10 phosphate buffer, pH 6.8. After loading, the column is washed extensively with equilibration buffer before elution 
with 100 mM citric acid, pH 3.5. The eluted protein is immediately neutralized by collecting 1 ml fractions into tubes 
containing 275 L of 1 M Tris buffer, pH 9. The highly purified protein is subsequently desalted into storage buffer as 
described above for the poly-His tagged proteins. The homogeneity is assessed by SDS polyacrylamide gels and by N- 
terminal amino acid sequencing by Edman degradation. 

15 

EXAMPLE 5 
Expression of EG-VEGF in Yeast 
The following method describes recombinant expression of EG-VEGF in yeast. 

First, yeast expression vectors are constructed for intracellular production or secretion of EG-VEGF from the 
20 ADH2/GAPDH promoter. DNA encoding EG-VEGF and the promoter is inserted into suitable restriction enzyme sites in 
the selected plasmid to direct intracellular expression of EG-VEGF. For secretion, DNA encoding EG-VEGF can be 
cloned into the selected plasmid, together with DNA encoding the ADH2/GAPDH promoter, a native EG-VEGF signal 
peptide or other mammalian signal peptide, or, for example, a yeast alpha-factor or invertase secretory signal/leader 
sequence, and linker sequences (if needed) for expression of EG-VEGF. 
25 Yeast cells, such as yeast strain AB110, can then be transformed with the expression plasmids described 

above and cultured in selected fermentation media. The transformed yeast supernatants can be analyzed by 
precipitation with 10% trichloroacetic acid and separation by SDS-PAGE, followed by staining of the gels with 
Coomassie Blue stain. 

Recombinant EG-VEGF can subsequently be isolated and purified by removing the yeast cells from the 
30 fermentation medium by centrifugation and then concentrating the medium using selected cartridge filters. The 

concentrate containing EG-VEGF may further be purified using selected column chromatography resins. 



EXAMPLE 6 

Expression of EG-VEGF in Baculovirus-lnfected Insect Cells 
35 The following method describes recombinant expression of EG-VEGF in Baculovirus-infected insect cells. 
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The sequence coding for EG-VEGF is fused upstream of an epitope tag contained within a baculovirus 
expression vector. Such epitope tags include poly-his tags and immunoglobulin tags (like Fc regions of IgG). A variety 
of plasmids may be employed, including plasmids derived from commercially available plasmids such as pVL1393 
(PharMingen). Briefly, the sequence encoding EG-VEGF or the desired portion of the coding sequence of EG-VEGF such 
5 as the sequence encoding the extracellular domain of a transmembrane protein or the sequence encoding the mature 

protein if the protein is extracellular is amplified by PCR with primers complementary to the 5' and 3' regions. The 5' 
primer may incorporate flanking (selected) restriction enzyme sites. The product is then digested with those selected 
restriction enzymes and suhcloned into the expression vector. 

For example, the coding sequences of human EG-VEGF were amplified by PCR and subcloned into the EcoRI 

10 and Stul sites of pBPH.IgG to generate a C-terminal fusion with the Fc region of human lgG1 or into the EcoRI and 

Smal sites of pBPH.His.c to generate a C-terminal GHHHHHHHH tag. Vectors pBPH.IgG and pBPH.His.c are 
derivatives of the baculovirus expression vector PVL1393 (PharMingen). 

Recombinant baculovirus is generated by co-transfecting the above plasmid and BaculoGold™ virus DNA 
(Pharmingen) into Spodoptera frugiperda ("Sf9") cells (ATCC CRL 1711) using lipofectin (commercially available from 

15 GIBCO-BRL). After 2-4 days of incubation at 28"C, the released viruses are harvested and used for further 

amplifications. Viral infection and protein expression are performed as described by O'Reilley et al., Baculovirus 
expression vectors: A Laboratory Manual , Oxford: Oxford University Press (1994). 

Expressed poly-his tagged EG-VEGF can then be purified, for example, by either protein A-Sepharose beads 
(Pharmacia) for Fc fusion proteins or Ni-NTA agarose heads (Qiagen) for His-tagged proteins. For His-tagged proteins, 

20 extracts are prepared from recombinant virus-infected Sf9 cells as described by Rupert et al.. Nature , 362:175-179 

(1993). Briefly, Sf9 cells are washed, resuspended in sonication buffer (25 mL Hepes, pH 7.9; 12.5 mM MgCI 2 ; 0.1 
mM EDTA; 10% glycerol; 0.1% NP-40; 0.4 M KCI), and sonicated twice for 20 seconds on ice. The sonicates are 
cleared by centrifugation, and the supernatant is diluted 50-fold in loading buffer (50 mM phosphate, 300 mM NaCI, 
10% glycerol, pH 7.8) and filtered through a 0.45 m filter. A Ni 2+ -NTA agarose column (commercially available from 

25 Qiagen) is prepared with a bed volume of 5 mL, washed with 25 mL of water and equilibrated with 25 mL of loading 

buffer. The filtered cell extract is loaded onto the column at 0.5 mL per minute. The column is washed to baseline 
A 28 o with loading buffer, at which point fraction collection is started. Next, the column is washed with a secondary 
wash buffer (50 mM phosphate; 300 mM NaCI, 10% glycerol, pH 6.0), which elutes nonspecifically bound protein. 
After reaching A 280 baseline again, the column is developed with a 0 to 500 mM Imidazole gradient in the secondary 

30 wash buffer. One mL fractions are collected and analyzed by SDS-PAGE and silver staining or Western blot with Ni 2+ - 

NTA-conjugated to alkaline phosphatase (Qiagen). Fractions containing the eluted His 10 -tagged EG-VEGF are pooled 
and dialyzed against loading buffer. 

Alternatively, purification of the IgG tagged (or Fc tagged) EG-VEGF can be performed using known 
chromatography techniques, including for instance. Protein A or protein G column chromatography. 
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To examine protein expression, SDS/PAGE analysis was performed on the affinity-purified, recombinant 
proteins under non-reducing and reducing conditions, followed by staining. Proteins were routinely submitted for l\l- 
terminal sequencing and endotoxin measurements, which were below 1 eu/mg. 

5 EXAMPLE 7 

Preparation of Antibodies that Bind EG-VEGF 
This example illustrates preparation of monoclonal antibodies which can specifically bind EG-VEGF. 
Techniques for producing the monoclonal antibodies are known in the art and are described, for instance, in 
Goding, supra . Immunogens that may be employed include purified EG-VEGF, fusion proteins containing EG-VEGF, and 
10 cells expressing recombinant EG-VEGF on the cell surface. Selection of the immunogen can be made by the skilled 

artisan without undue experimentation. 

Mice, such as Balb/c, are immunized with the EG-VEGF immunogen emulsified in complete Freund's adjuvant 
and injected suhcutaneously or intraperitoneal^ in an amount from 1-100 micrograms. Alternatively, the immunogen 
is emulsified in MPL-TDM adjuvant (Ribi Immunochemical Research, Hamilton, MT) and injected into the animal's hind 
15 foot pads. The immunized mice are then boosted 10 to 12 days later with additional immunogen emulsified in the 

selected adjuvant. Thereafter, for several weeks, the mice may also be boosted with additional immunization 
injections. Serum samples may be periodically obtained from the mice by retro-orbital bleeding for testing in ELISA 
assays to detect anti-EG-VEGF antibodies. 

After a suitable antibody titer has been detected, the animals "positive" for antibodies can be injected with 
20 a final intravenous injection of EG-VEGF. Three to four days later, the mice are sacrificed and the spleen cells are 
harvested. The spleen cells are then fused (using 35% polyethylene glycol) to a selected murine myeloma cell line 
such as P3X63Agll.1, available from ATCC, No. CRL 1597. The fusions generate hybridoma cells which can then be 
plated in 96 well tissue culture-plates containing HAT (hypoxanthine, aminopterin, and thymidine) medium to inhibit 
proliferation of non-fused cells, myeloma hybrids, and spleen cell hybrids. 
25 The hybridoma cells will be screened in an ELISA for reactivity against EG-VEGF. Determination of 

"positive" hybridoma cells secreting the desired monoclonal antibodies against EG-VEGF is within the skill in the art. 

The positive hybridoma cells can be injected intraperitoneally into syngeneic Balb/c mice to produce ascites 
containing the anti-EG-VEGF monoclonal antibodies. Alternatively, the hybridoma cells can be grown in tissue culture 
flasks or roller bottles. Purification of the monoclonal antibodies produced in the ascites can be accomplished using 
30 ammonium sulfate precipitation, followed by gel exclusion chromatography. Alternatively, affinity chromatography 

based upon binding of antibody to protein A or protein G can be employed. 
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EXAMPLE 8 

Purification of EG-VEGF Polypeptides Using Specific Antibodies 
Native or recombinant EG-VEGF polypeptides may be purified by a variety of standard techniques in the art 
of protein purification. For example, pro-EG-VEGF polypeptide, mature EG-VEGF polypeptide, or pre-EG-VEGF 
5 polypeptide is purified by immunoaffinity chromatography using antibodies specific for the EG-VEGF polypeptide of 

interest. In general, an immunoaffinity column is constructed by covalently coupling the anti-EG-VEGF polypeptide 
antibody to an activated chromatographic resin. 

Polyclonal immunoglobulins are prepared from immune sera either by precipitation with ammonium sulfate or 
by purification on immobilized Protein A (Pharmacia LKB Biotechnology, Piscataway, N.J.). Likewise, monoclonal 
10 antibodies are prepared from mouse ascites fluid by ammonium sulfate precipitation or chromatography on immobilized 
Protein A. Partially purified immunoglobulin is covalently attached to a chromatographic resin such as CnBr-activated 
SEPHAROSE™ (Pharmacia LKB Biotechnology). The antibody is coupled to the resin, the resin is blocked, and the 
derivative resin is washed according to the manufacturer's instructions. 

Such an immunoaffinity column is utilized in the purification of EG-VEGF polypeptide by preparing a fraction 
15 from cells containing EG-VEGF polypeptide in a soluble form. This preparation is derived by solubilization of the whole 

cell or of a subcellular fraction obtained via differential centrifugation by the addition of detergent or by other methods 
well known in the art. Alternatively, soluble EG-VEGF polypeptide containing a signal sequence may be secreted in 
useful quantity into the medium in which the cells are grown. 

A soluble EG-VEGF polypeptide-containing preparation is passed over the immunoaffinity column, and the 
20 column is washed under conditions that allow the preferential absorbance of EG-VEGF polypeptide [e.g., high ionic 

strength buffers in the presence of detergent). Then, the column is eluted under conditions that disrupt antibody/EG- 
VEGF polypeptide binding (e.g., a low pH buffer such as approximately pH 2-3, or a high concentration of a chaotrope 
such as urea or thiocyanate ion), and EG-VEGF polypeptide is collected. 



25 EXAMPLE 9 

Drug Screening, 

This invention is particularly useful for screening compounds by using EG-VEGF polypeptides or binding 
fragment thereof in any of a variety of drug screening techniques. The EG-VEGF polypeptide or fragment employed in 
such a test may either be free in solution, affixed to a solid support, borne on a cell surface, or located intracellular^. 

30 One method of drug screening utilizes eukaryotic or prokaryotic host cells which are stably transformed with 
recombinant nucleic acids expressing the EG-VEGF polypeptide or fragment. Drugs are screened against such 
transformed cells in competitive binding assays. Such cells, either in viable or fixed form, can be used for standard 
binding assays. One may measure, for example, the formation of complexes between EG-VEGF polypeptide or a 
fragment and the agent being tested. Alternatively, one can examine the diminution in complex formation between the 

35 EG-VEGF polypeptide and its target cell or target receptors caused by the agent being tested. 
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Thus, the present invention provides methods of screening for drugs or any other agents which can affect a 
EG-VEGF polypeptide-associated disease or disorder. These methods comprise contacting such an agent with an EG- 
VEGF polypeptide or fragment thereof and assaying (I) for the presence of a complex between the agent and the EG- 
VEGF polypeptide or fragment, or (ii) for the presence of a complex between the EG-VEGF polypeptide or fragment and 
5 the cell, by methods well known in the art. In such competitive binding assays, the EG-VEGF polypeptide or fragment 

is typically labeled. After suitable incubation, free EG-VEGF polypeptide or fragment is separated from that present in 
bound form, and the amount of free or uncomplexed label is a measure of the ability of the particular agent to bind to 
EG-VEGF polypeptide or to interfere with the EG-VEGF polypeptide/cell complex. 

Another technique for drug screening provides high throughput screening for compounds having suitable 

10 binding affinity to a polypeptide and is described in detail in WO 84/03564, published on September 13, 1984. Briefly 

stated, large numbers of different small peptide test compounds are synthesized on a solid substrate, such as plastic 
pins or some other surface. As applied to a EG-VEGF polypeptide, the peptide test compounds are reacted with EG- 
VEGF polypeptide and washed. Bound EG-VEGF polypeptide is detected by methods well known in the art. Purified 
EG-VEGF polypeptide can also be coated directly onto plates for use in the aforementioned drug screening techniques. 

15 In addition, non-neutralizing antibodies can be used to capture the peptide and immobilize it on the solid support. 

This invention also contemplates the use of competitive drug screening assays in which neutralizing 
antibodies capable of binding EG-VEGF polypeptide specifically compete with a test compound for binding to EG-VEGF 
polypeptide or fragments thereof. In this manner, the antibodies can be used to detect the presence of any peptide 
which shares one or more antigenic determinants with EG-VEGF polypeptide. 

20 The present invention also provides for methods of screening for a bioactive agent that is capable of 

modulating the activity of EG-VEGF. In these method a candidate bioactive agent is added to a sample of EG-VEGF 
and it is determined if an alteration in the biological activity of EG-VEGF results. Such an alteration might be in EG- 
VEGF's ability to stimulate cell proliferation, to induce chemotaxis, to stimulate angiogenesis, to induce cell 
differentiation or to phosphorylate ERK1 and ERK2. 

25 

EXAMPLE 10 
Rational Drug Design 

The goal of rational drug design is to produce structural analogs of biologically active polypeptide of interest 
[i.e., a EG-VEGF polypeptide) or of small molecules with which they interact, e.g., agonists, antagonists, or inhibitors. 
30 Any of these examples can be used to fashion drugs which are more active or stable forms of the EG-VEGF polypeptide 

or which enhance or interfere with the function of the EG-VEGF polypeptide in vivo (c.f., Hodgson, Bio/Technology , 9: 
19-21(1991)). 

In one approach, the three-dimensional structure of the EG-VEGF polypeptide, or of an EG-VEGF 
polypeptide-inhibitor complex, is determined by x-ray crystallography, by computer modeling or, most typically, by a 
35 combination of the two approaches. Both the shape and charges of the EG-VEGF polypeptide must be ascertained to 



WO 02/00711 



PCT/US01/20116 



elucidate the structure and to determine active site(s) of the molecule. Less often, useful information regarding the 
structure of the EG-VEGF polypeptide may be gained by modeling based on the structure of homologous proteins. In 
both cases, relevant structural information is used to design analogous EG-VEGF polypeptide-like molecules or to 
identify efficient inhibitors. Useful examples of rational drug design may include molecules which have improved 
5 activity or stability as shown by Braxton and Wells, Biochemistry, 31:7796-7801 (1992) or which act as inhibitors, 

agonists, or antagonists of native peptides as shown by Athauda eta/., J. Biochem. , 113:742-746 (1993). 

It is also possible to isolate a target-specific antibody, selected by functional assay, as described above, and 
then to solve its crystal structure. This approach, in principle, yields a pharmacore upon which subsequent drug 
design can be based. It is possible to bypass protein crystallography altogether by generating anti-idiotypic antibodies 
10 (anti-ids) to a functional, pharmacologically active antibody. As a mirror image of a mirror image, the binding site of 

the anti-ids would be expected to be an analog of the original receptor. The anti-id could then be used to identify and 
isolate peptides from banks of chemically or biologically produced peptides. The isolated peptides would then act as 
the pharmacore. 

By virtue of the present invention, sufficient amounts of the EG-VEGF polypeptide may be made available to 
15 perform such analytical studies as X-ray crystallography. In addition, knowledge of the EG-VEGF polypeptide amino 

acid sequence provided herein will provide guidance to those employing computer modeling techniques in place of or in 
addition to x-ray crystallography. 

EXAMPLE 11 

20 Northern Blot Analysis 

To elucidate the expression pattern of EG-VEGF, Northern blot analysis was performed using RNA from a 
wide variety of human tissues. Human RNA blots were hybridized to a 32 P-labelled DNA probe based on the EG-VEGF 
cDNA. Human multiple tissue polyA+RNA array and human polyA+RNA multiple tissue northern blots were 
purchased from Clontech. Other northern blots used included human fetal RNA blot MTN (Clontech) and human adult 

25 RNA blot MTW-II (Clontech). 

Northern blot analysis was performed according to methods well known in the art. For example, cDNA 
probes were prepared using 30-50 ng of the human or mouse cDNA fragments with the Redi-Prime II kit (Amersham), 
using 32 P-dCTP 3000 Ci/mmol (Amersham). Probes were purified on Sephadex G50 spin columns (Pharmacia) and 
hybridization was carried out at 68°C in ExpressHyb hybridization solution (Stratagene). In another example, blots 

30 were incubated with the probes in hybridization buffer (5 X SSPE; 2 X Denhardt's solution; 100 mg/mL denatured 

sheared salmon sperm DNA; 50% formamide; 2% SDS) for 60 hours at 42°C. The blots were washed several times in 
2 X SSC; 0.05% SDS for 1 hour at room temperature, followed by a 30 minute wash in 0.1 X SSC; 0.1% SDS at 
50°C. The blots were developed after overnight exposure by phosphorimager analysis (Fuji). Equivalent RNA loading 
was assessed by hybridization with the control actin probe. 
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EG-VEGF mRNA transcripts were detected. Figure 18 shows that, in several independent experiments, a 
single mRNA species of 1 .4 kb was expressed, in decreasing order of intensity, in ovary, testis, adrenal gland and 
placenta. No expression was detected in any other tissue, with the exception of a very weak signal in the prostate 
after prolonged exposure (Figure 18). Similar findings were obtained by in situ hybridization in human multi-tissue 
5 arrays (data not shown). These findings indicate that steroidogenic endocrine glands are the major site of expression 

of EG-VEGF mRNA. 

EXAMPLE 12 
In Situ Ligand Binding 

10 To identify the cell types that express EG-VEGF, in situ hybridization studies were performed using a series 

of testis and ovary specimens from human and other primates. Several sets of experiments were performed, one set 
using fluorescence and one using labelled probes. 

In the first set of experiments, unfixed, fresh tissue embedded in OCT was sectioned at 10 microns and 
stored at -20°C for 4 days without desiccant. All steps were carried out at room temperature, with the exception of 

15 the fixation, performed at 4°C. Sections were incubated in 35 mM acetic acid (pH 3.5), 3 mM CaCI 2 , 3 mM MgS0 4 , 5 

mM KCI, 1 M NaCI for 4 minutes. The slides were washed 3 times with HEPES buffered saline plus cation buffer (25 
mM Hepes (pH7/2), 150 mM NaCI, 3 mM CaCI 2 , 3 mM MgSO„, 5 mM Kcl, and 32 mM sucrose) prior to blocking with 
1.5% mouse serum in HFJS-C plus protease inhibitors (Boehringer Manneheim) for 20 minutes. Further blocking was 
carried out using the avidin/biotin blocking kit from Vector Laboratories. The initial experiment used liver, adrenal 

20 gland and ovary prepared from adult rat. As described below, other tissues were analyzed. These sections were 

incubated with ligand, ranging from 0 to 1.5 nM EG-VEGF-Fc, for 60 minutes. Displacement was achieved using 
excess EG-VEGF-his tagged protein. Sections were washed with HBS-C with 0.1% BSA 3 times, and then fixed for 10 
minutes in 4% paraformaldehyde. A 1:4000 dilution of secondary antibody, biotinylated goat anti-human Fc (Jackson 
Laboratories) with 1.5% mouse serum in HBS-C, was added to sections for 30 minutes. After washing, sections were 

25 post-fixed for 10 minutes in paraformaldehyde. Slides were washed with TBS (100 mM Tris-HCI (pH8.0), 150 mM 

NaCI) twice and then treated with 3% H 2 0 z in TBS for 10 minutes. Following washes with TBS, sections were 
reacted with Dupont TSA kit components, HRP-streptavidin at 1:1000 and biotinylated tyramide 1:50 in amplification 
diluent. Finally, FITC-conjugated streptavidin (DAKO), 1:1000 dilution in TBS, was reacted for 30 minutes. After 
extensive washing in TBS with 0.05% Tween-20, coverslips were applied using Vectashield mounting media. Images 

30 were obtained using the FITC channel of a fluorescent microscope. In some cases, DAPI (e.g., Molecular Probes), a 

nucleic acid stain was used. DAPI can be excited with a mercury -arc lamp or with the UV lines of the argon-ion laser. 
Additionally, hematoxylin-eosin stains were used according to standard methods previously described in order to show 
the structure of the tissue where expression was demonstrated. 

In chimp and monkey ovaries, the most intense signal was over the granulosa cells, especially those nearest 

35 to the oocyte, in primary follicles. Weaker signal was present over granulosa cells in primordial follicles and in 
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cumulus oophorus. Significant signal was present in the ovarian cortex, especially around small, involuted corpora 
lutea [cyno monkey sample) and in poorly defined clusters of cells at the vascular pole of the ovary. The results are 
shown in Figures 5A-F (all at 55 microns). Figure 5A shows a hematoxylin-eosin stain and Figure 5B shows EG-VEGF 
expression of EG-VEGF using FITC in the ovary of a 2 year old chimp. Figure 5C shows a hematoxylin-eosin stain and 
5 Figure 5D shows EG-VEGF expression using FITC in the ovary of a cyno monkey. Figure 5E shows a hematoxylin- 

eosin stain and Figure 5F shows EG-VEGF expression using FITC in a chimp stromal ovary. 

In human ovaries, there was diffuse moderate expression in cortical stroma. The signal was intense over 
theca externa around corpus hemorrhagicum. Additionally, there was moderate focal positive signal in corpora lutea. 
The results are shown in Figures 6A-D. Figure 6A shows a hematoxylin-eosin stain and Figure 6B shows EG-VEGF 

10 expression using FITC (25 microns). Figure 6C shows a hematoxylin-eosin stain and Figure 6D shows EG-VEGF 

expression using FITC (1 15 microns). 

In situ hybridization studies were also performed on an ovary from a 46 year old woman with uterine 
prolapse. In this case, EG-VEGF was radioactively labeled using standard techniques previously described. The 
autoradiogram showing EG-VEGF expression is shown in Figure 7A. Figure 7B shows a hematoxylin-eosin stain of the 

15 same ovary. 

Additionally, in situ hybridization studies were performed on rat adrenal cortex tissue sections. Figures 8A 
and 8B show EG-VEGF nucleic acid identification using DAPI and Figures 8C and 8D show EG-VEGF protein 
identification using FITC. 

Furthermore, positive in situ hybridization results were found in the following tissues: ovarian carcinoma, 

20 papillary serous; ovarian, normal cortex; ovarian cyst, simple cyst; testis, chronic inflammation; normal ovary; ovarian 

tumor, papillary serous; adrenal adenoma; and normal testis.ln the second set of experiments, 33 P-UTP-labelled RIMA 
probes were generated according to methods known in the art. For example, according to the method described in 
Melton, D. A., et al.. Nucleic Acids Res. , 12:7035-7056 (1984). Sense and antisense probes were synthesized from a 
cDNA fragement corresponding to nucleotides 219-958 of the human EG-VEGF sequence. The in situ hybridization 

25 was carried out according to methods well known in the art. The results of this set of experiments are shown in 

Figure 9. Panels A, C, E, G, I, K, M and 0 are bright-field images, and B, D, F, H, J, L, N and P are the corresponding 
dark-field images. Arrows indicate the location of a blood vessel. 

In the testis, a strong hybridization signal was detected (Figure 9, panels A-E). Figure 9, panels A-E show 
that this signal was restricted to the testosterone producing Leydig cells. It is noteworthy that the endothelium of the 

30 testis has a surprisingly high turnover: as many as 3% of endothelial cells are labeled with BrdU, probably related to 

the intense metabolic activity associated with spermatogenesis and steroidogenesis [Collin and Bergh, Int. J. Androl., 
19:221-228 (1996)]. In this context, Leydig cells are known to be a source of angiogenic and permeability-enhancing 
factors such as VEGF [Collin, supra] , Interestingly, in these experiments the VEGF hybridization signal was 
considerably less intense than the EG-VEGF signal (data not shown). In the ovary, intense EG-VEGF mRNA expression 

35 was localized to the androgen-producing cortical stroma, the cumulus oophorus, theca and granulosa of developing 
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follicles (Figure 9, panels F-P). The strong expression of EG-VEGF mRNA within the theca is coincident with the 
development of a capillary network associated with follicular development and steroid hormone production [Basset, 
Am. J. Anat , 73:251-278 (1943)], and is consistent with a pro-angiogenic role of EG-VEGF. This expression pattern 
is essentially similar in the other primate species examined, cynomologus monkey and chimpanzee (Figure 9, panels G-l 
5 and data not shown). While VEGF has a very low expression in the specialized stroma [Ravindranath et al.. 

Endocrinology, 131:254-260 (1992)], EG-VEGF is instead strongly expressed (Figure 9, panels F, G). Interestingly, 
such stroma becomes intensely hyperplastic and angiogenic, producing excessive amounts of androgens, in the 
polycystic ovary syndrome [Goldziher and Green, J. Clin. Endocrino. Meta. , 22:325-332 (1962)]. The expression 
pattern of EG-VEGF in the cumulus oophorus (Figure 9, panels N-P) is very similar to that of VEGF Ravindranath et al., 

10 supra ; Philips et al.. Endocrinology , 127:965-967 (1990)]. In the corpus luteum (CL), EG-VEGF expression was 

localized to discrete nests of cells (data not shown). While EG-VEGF is highly expressed in the ovarian stroma and 
within developing follicles (Figure 9, panels D-l and data not shown), VEGF is expressed at the highest level in the CL 
[Ravindranath et al., supra ]. Inactivation of VEGF in rodents [Ferrara et al., Mat. Med. , 4:336-340 (1998)] or in 
primates [Ryan et al., Toxicol. Pathol. , 27:78-86 (1999); Fraser et al., Endocrinology , 141:995-1000 (2000)] results in 

15 a dramatic suppression of CL development. However, follicular development proceeds almost unimpaired (unpublished 

observations), indicating that other factors contribute to follicular angiogenesis. It is likely that EG-VEGF contributes 
to such VEGF-independent angiogenesis. Thus, the partially complementary expression patterns of VEGF and EG-VEGF 
indicate that these molecules may function in a coordinated or complementary manner to promote angiogenesis in the 
ovary, and potentially in other tissues that co-express these factors. 

20 

EXAMPLE 13 
125 l EG-VEGF Cell Binding Assays 
125 l ligand binding sudies were performed with EG-VEGF to characterize the receptor distribution on a variety 
of cell types. In one case, EG-VEGF-tagged protein, 500 g, was labeled with 0.5 mCi ,25 I using the lodobead 
25 preparation from Pierce. The labeled protein was purified on a C1 8 Sep Pak in 50% acetonitrile, 0.1% TFA. In a 

second set of experiments, EG-VEGF was iodinated by the lactoperoxidase method using l\la 1Z5 l (Amersham) and the 
labeled protein was subsequently purified by reverse phase chromatography using a prepacked SynChropak RP HPLC 
C4 column (Micra Scientific, Inc.). The specific activity of the molecule ranged from 49 to 70 Ci/ g. Competitive 
binding assays were performed on various cell types including Cos cells. Cells were cultured for 2-3 days in 24-well 
30 dishes prior to binding. 

In one set of experiments, assays were performed using 0.2 to 0.4 nM labeled ligand and unlabeled EG-VEGF- 
his as competitor. The highest concentration of unlabeled EG-VEGF-his used in the assays was 270 nM, and a three- 
fold dilution series for the next 10 concentrations of competitor. bFGF and VEGF were also included as competitors 
(at 500-fold molar excess), with neither ligand displacing the labeled EG-VEGF-his. 
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In another set of experiments, competition assays mere performed by adding unlabeled, tagged EG-VEGF at 
3-fold serial dilutions starting at 270 nM for 12 concentrations in at least duplicate. In other experiments, unrelated 
ligands including bFGF and VEGF at 200-molar excess were added to confirm specific binding. Ligand binding data 
was analyzed using the New Ligand program. 
5 The competition analysis revealed high affinity, specific binding of EG-VEGF to ACE cells with a Kd of 0.9- 

1 .0 nM with an ED 50 = 1 0 nM (Figures 1 0A, 1 QB, and 1 0Cj. A similar high affinity binding was evidenced in MS- 1 cells 
(Figures 1 1 A and B). However, other non-responsive cells, including HUVEC, exhibited a low affinity binding 
component (Kd of > 80 nM) or little significant binding and the apparent non-specific, low affinity binding was only 
displaced at competitor concentrations above 270 nM (Figure 11 C and data not shown). Similarly, 5 other non- 
10 responsive cell types assayed did not display the high affinity binding component. Protein cross-linking experiments 
and membrane protein phosphorylation analyses indicate that the EG-VEGF receptor components are restricted to the 
specific EG-VEGF responsive cells (data not shown). 



EXAMPLE 14 

15 Cell Proliferation Assays 

To determine the spectrum of cell types that respond to EG-VEGF, a panel of endothelial and non-endothelial 
cell types was assayed for proliferative response. Cells were grown in the presence of a range of concentration of Fc- 
tagged and His-tagged EG-VEGF ligand for 5-8 days. The cell types assayed included bovine adrenal cortical capillary 
endothelial cells (ACE), bovine brain capillary endothelial cells (BBC), human umbilical vein endothelial cells (HUVEC), 

20 human dermal microvascular endothelial cells (HMVEC), adult bovine aortic endothelial cells (ABAE), bovine pericytes, 

human aortic vascular smooth muscle cells (HA-VSMC), baby hamster kidney fibroblasts (BHK-21) and human neonatal 
fibroblasts hFb. Cells were plated at a density of 4000 to 6000 cells/ml in 6 or 12 well dishes. At time of plating, no 
ligand, 5 ng/ml bFGF (GIBCO BRL), 5 ng/ml VEGF, or a range of EG-VEGF-Fc or -His tagged protein from 1 nM to 100 
nM was added to the wells. Duplicate or triplicate samples were plated for each cell type. Confluence and 

25 morphology were monitored daily. At day 5 to 7 of the assay, cells were trypsinized in 0.01 % trypsin (Gibco BRL) and 

cell number was assessed using a Coulter counter. Media and other cell culture reagents were obtained from Life 
Technolgies, Inc. HUVEC, HMVEC and human keratinocytes were purchased from Clonetics. Bovine pericytes 
andhuman neonatal fibroblasts were a gift from M. Gerritsen. For the performance of assay see also Aravind and 
Koonin, Cutr. Biol. 8:477-478 (1998). 

30 Several preliminary results are shown in Figure 12A-E which indicate cells per well on the verticle axis. Each 

graph indicates relative proliferation of cells using either a control, bFGF, VEGF, or EG-VEGF in the concentration of 1 
nM, 10 nM or 25 nM. Figure 12A shows the results using pericytes. Figure 12B shows the results using human 
aortic vascular smooth muscle cells (HA-VSMC). Figure 12C shows the results using baby hamster kidney fibroblasts 
(BHK21). Figure 12D shows the results using ACE. Figure 12E shows the results using bovine brain capillary 

35 endothelial cells (BBC). 
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Similar results were obtained in the experiments the results of which are shown in Figure 13. In these 
experiments, basal media served as a negative control (Ct) and bFGF (F) or VEGF (V) served as positive controls. EG- 
VEGF was tested at a range of concentrations. As illustrated in panel a, EG-VEGF stimulated proliferation of ACE cells 
with an ED 50 of 0.2 nM. A maximal effect was observed at approximately 2 nM. The fold increase in cell number was 
5 very similar to that induced by VEGF. The baculovirus-produced his- and Fc-tagged EG-VEGF proteins behaved very 
similarly in all experiments. Consistent with a mitogenic function, EG-VEGF induces in ACE cells a rapid and significant 
phosphorylation of the MAP kinases ERK1 and 2, as well as of other proliferation and survival signaling molecules. 
The mitogenic activity of EG-VEGF was not hlocked by administration of a VEGF soluble receptor (mFlt-IgG) tested at a 
concentration range between 50 and 1000 ng/ml, indicatingthat such effect is no mediated by VEGF release. These 
1 0 findings clearly do not rule out the possibility of reciprocal inductive effects in more complex in vivo systems. 

To determine the spectrum of cell types that proliferate in response to EG-VEGF, we tested a variety of 
other endothelial and non-endothelial cell types. The endothelial cell types tested include human umbilical vein 
(HUVEC), human dermal microvascular (HMVEC), bovine brain capillary (BBC), and adult bovine aortic (ABAE) 
endothelial cells. 

15 As can be seen in Figure 13, panel b, BBC cells showed only about 10% of the proliferative response seen 

with VEGF and the other endothelial cell types failed to show any effect following EG-VEGF administration, at all 
concentrations tested. Thus EG-VEGF is believed to be the first example of an endothelial cell mitogen that has such a 
restricted target cell specificity. Other endothelial cell mitogens such as VEGF and bFGF do not show any significant 
selectivity for various endothelial cell types [Leung et al., Science , 246:1306-1309 (1989)]. 

20 Figure 13, panel c illustrates the finding that EG-VEGF did not elicit any proliferative response in cultures of 

vascular smooth muscle cells, pericytes, fibroblasts or keratinocytes. Thus, EG-VEGF is not only an endothelial- 
specific mitogen, but also one that acts selectively on a defined endothelial cell type. 

In Figure 1 3, panels b and c the proliferation index on the vertical axis is relative to the negative control, 
which is arbitrarily set at 1 . 

25 

EXAMPLE 15 
Cell Migration (Chemotaxis) Assays 
An essential aspect of the angiogenesis cascade is chemotaxis [Zetter, Mature , 285:4143 (1980)]. VEGF or 
bFGF is able to act as a chemoattractant and stimulate endothelial cell migration. Assays for chemotactic activity 
30 (which will identify proteins that induce or prevent chemotaxis) have been previously described, see, for example, 

Current Protocols in Immunology, Ed by Coligan, Kruisbeek, Margulies, Shevach and Strober, Pub. Greene Publishing 
Associates and Wiley-lnterscience, Chapter 6.12: 6.12.1-6.12:28; Taub et al., J. Clin. Invest. 95:1370-1376, 1995; 
Lind et al., APMIS 103:140-146, 1995; Mueller et al., Eur. J. Immunol. 25:1744-1748; Gruber et al., J Immunol. 
152:5860-5867, 1994; Johnston et al., J Immunol., 153:1762-1768, 1994. To define the biological activity of EG- 
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VEGF, we assessed whether ACE cells and several other cell types display a chemptactic response to this molecule in 
a modified Boyden chamber assay. 

Specifically, ACE cells, baboon adrenal endothelial cells (BAEC), MS-1 cells and HUVEC were used for 
migration assays. The MS-1 cell line was from the ATCC. Primary baboon endothelial cells were isolated from 
5 adrenal glands of premature or fetal baboons (gift of R. Clyman, UCSF). The tissue was dissociated essentially as 
described in Mesiano et al., J. Clin. Endocrinol. Metab. 76:968-976, 1993. Briefly, the capsule was removed and the 
remaining tissue was finely chopped into fragments of approximately 2 mm 3 in size with sterile razor blades. The 
fragments were subsequently incubated at 37°C for 30-40 minutes in 0.1% collagenase (Sigma) in 50:50 Ham's 
F10:DMEM media, 10% FCS with the addition of DNase I (Life Technologies, Inc.). Single cell suspensions were 

10 washed, resuspended in PBS with 5% FCS, incubated with 1 g of anti-KDR monoclonal antibody for 10 7 cells, 

washed, and then incubated with goat anti-mouse antibody conjugated to fluoroscein isothiocyanate (Sigma). The 
KDR-positive versus negative populations were sorted using the EPICS ELITE cell sorter (Coulter Corporation). The 
endothelial cell population was maintained in CSC-medium containing 10% serum and growth factors (Cell Systems, 
Kirkland, WA). Falcon 8.0 m filter inserts (Falcon 3097) were coated with 1% gelatin. Cells were cultured prior to 

15 assay as described above. Cells were trypsinized and transferred to EBM (Endothelial basal media, Clonetics) with 

0.1% BSA for the assay. Cells were plated at 1-5x10" per upper chamber and growth factors were placed in the 
lower chamber. The assay was routinely a 16 hour assay at 37 °C. At completion, cells were removed from the 
upper side of the membrane by scraping with a polyurethane swab and then the remaining cells on the bottom of the 
membrane were fixed with methanol. Cells were stained with propidium iodide and counted under low power using 

20 the Image-Pro program. The y-axis in Figures 14A, 14B and Figure 13, panel d represents the migration index and is 
relative to the negative control, which is arbitrarily set at 1. 

Preliminary results are shown in Figures 14A and 14B. Each graph shows relative endothelial cell migration 
in a control, in the presence of VEGF, or in the presence of EG-VEGF at a concentration of .2 nM, .5 nM, 1 nM or 5 
nM. Figure 14B shows the results using ACE. 

25 These observations were reinforced by the results of further experiments shown in Figure 13, panel d. A 

strong and reproducible chemotactic response was elicited in ACE cultures, with a peak response at 0.5 nM EG-VEGF. 
The magnitude of the effect was similar to that induced by VEGF. To extend our results to another primary cell type, 
we purified endothelial cells from baboon adrenal cortex (BAEC) by fluoresence-activated cell sorting, using a 
monoclonal antibody that recognizes the VEGF receptor-2, or KDR. In BAEC cultures, 0.5-5 nM EG-VEGF induced a 

30 chemotactic response of the same magnitude as that induced by VEGF. Furthermore, EG-VEGF induced migration in 

MS-1 endothelial cells to an even greater extent than VEGF. The MS-1 cell line, isolated from microvessels of murine 
endocrine pancreas, retains highly differentiated properties, such as VEGF receptor expression [Arbiser et al., Proc. 
Natl. Acad. Sci. U.S.A., 94:861-866 (1997)]. However, no response was elicited from HUVEC, even though these 
cells displayed a strong response to the VEGF. Therefore, in addition to its mitogenic activity, EG-VEGF also acts as a 

3 5 chemoattractant, but only for a specific endothelial cell type. 
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These findings are in full agreement with the binding studies presented in Example 13, which revealed that 
iodinated EG-VEGF shows high-affinity, specific binding in ACE cells, that could be cross-linked to a membrane protein. 
However, no high affinity binding was detected in non-responsive cells such as HUVEC. 

5 EXAMPLE 16 

Phosphorylation Analysis 

Phosphorylation analysis was carried out according to methods known in the art. Briefly, adrenal cortex- 
derived capillary endothelial (ACE) cells were serum-deprived in 0.1% FCS for 14-16 hours, followed by an additional 
90 minutes in 0.05% BSA. Cells were then treated with 2.5 mM VEGF (to induce maximal response) or 20 nM EG- 
10 VEGF adn washed once in ice-cold PBS containing 0.1 mM sodium orthovanadate. Cells were lysed in 0.5 ml RIPA 

buffer containing 0.1 mM sodium orthovanadate, 5 mM para-nitrophenylphosphate, 10 mM sodum fluoride, 0.5 mM 
okadaic acid adn a protease inhibitor cocktail (Roche MB 1836145). Anti-phospho ERK antiserum was purchased 
from Promega. 

Consistent with a mitogenic function, EG-VEGF induces in ACE cells a rapid and significant phosphorylation of 
15 the MAP kinases ERK1 and ERK2 (Figure 15), as well as of other proliferation and survival signaling molecules (data 

not shown). Specifically, Figure 15 shows that 20 nM EG-VEGF induced a significant and rapid phosphorylation of 
ERK1 and 2. No factor (ct) and VEGF (V) stimulation for 5 minutes served as controls and the time course for EG- 
VEGF stimulation is indicated above the lanes. Total ERK 1 and 2 levels are shown in the lower immunoblot. Further, 
the mitogenic activity of EG-VEGF was not blocked by administration of a VEGF soluble receptor (mFIMgG) tested at a 
20 concentration range between 50 and 1 000 ng/ml, indicating that such effect is not mediated by VEGF release. 

EXAMPLE 17 
Fenestration Assays 

Endothelial cells within the adrenal cortex display a rather unique fenestrated phenotype found in restricted 
25 sites also including other endocrine glands, the choroid plexus, the gastrointestinal tract, and many tumors 

[Simionescu, supra l. Fenestrae are specialized plasma membrane microdomains or windows, approximately 60 nm in 
diameter, that are usually clustered [Palade et al.. Acta Physiol. Scand. Suppl. , 463 :1 1-32 (1979)]. The fenestrae are 
highly permeable to fluid and small solutes and are thought to facilitate large exchange of materials between 
interstitial fluid and plasma, such as that occurring in steroid-producing as well as other endocrine glands like 
30 pancreatic islets. We tested whether EG-VEGF could induce fenestrations in endothelial cells, alone or in combination 

with VEGF. 

ECM was prepared according to a method known in the art, essentially as described in Gospodarowicz et al., 
J. Cell. Biol. 99:947-961, 1984. Briefly, corneal endothelial cells were isolated from steer eyes (Pel Freez, Arkansas) 
and these were expanded in 50:50 Ham's F10:DMEM media supplemented with 15% FCS, penicillin/streptomycin, 
35 fungizone. To prepare the ECM-coated plates, 4 X 10 4 cells were plated per well in 6-well dishes and cultured for 
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approximately 10 days in low glucose DMEM supplemented with 10% FCS, 2.5% dextran (Sigma 4133) and 
penicillin/streptomycin. At the end of 10 days, the cells were quickly lysed in 0.02 M NH 4 0H in water, rinsed several 
times with PBS and stored at 4°C in PBS with antibiotics. ACE or MS-1 cells were plated at a density of 1-2 X 10 5 
and grown to confluence. No addition, 2.5 nM VEGF, 10 nM EG-VEGF, or 2.5 nM VEGF plus 10 nM EG-VEGF were 
5 added to individual wells, in at least duplicate. The fenestration assays were replicated 3 times. Cells were rinsed 

with PBS and fixed for 2 hr in 2% formaldehyde, 2.5% glutaraldehyde in 0.1 M cacodylate buffer. After washing, the 
samples were post-fixed in aqueous 1% osmium for 2 hr, washed in water, dehydrated through graded ethanol and 
propylene oxide, and embedded in EPONATE 12 (Ted Pella, Inc., Redding CA). Ultrathin sections were cut on a 
Reichert Ultracut E microtome, counter-stained with uranyl acetate and lead citrate, examined in a Philips CM12 

10 transmission electron microscope at 80 kV and images were captured with a GATAN Retractable Multiscan digital 

camera. At least 100 cell profiles were examined for each sample. 

Figure 13 shows representative electron micrographs from these experiments. Panel e shows untreated ACE 
cells. Figure 13, panel f shows ACE cells treated with VEGF. Figure 13, panel g shows ACE cells treated with EG- 
VEGF. Arrowheads indicate the location of fenestrae and magnification is indicated. 

1 5 Notably, so far only VEGF has been reported to display the ability to induce fenestrations, in vivo and in vitro 

[Roberts and Palade, Cancer Res. , 57:765-772 (1997); Esser et al„ J. Cell. Biol. . 140:947-959 (1998)]. ACE or MS-1 
cells were grown to confluence on extracellular matrix produced by bovine corneal endothelial cells and then treated 
with ligand for 24-72 hours. In agreement with previous studies [Esser et al., supra] , VEGF induced fenestrations in 
4.33 ± 1.53 % of ACE cell profiles (Figure 13, panel f). The MS-1 cultures acquired a similar number of fenestrations 

20 in response to VEGF (not shown). The effect of EG-VEGF was very similar to that of VEGF, in both cell types (Figure 

13, panel g for ACE). A combination of the two factors produced an additive or co-operative response, inducing 
fenestrae in 1 1 ± 1% of the ACE cell profiles. No fenestrations were observed in the absence of VEGF or EG-VEGF. 
This finding supports the hypothesis that these factors may cooperate in vivo, in settings such as the adrenal cortex 
or the ovary, to induce the fenestrated phenotype of the resident endothelial cells. 

25 

EXAMPLE 18 
Hypoxic Regulation of EG-VEGF 
A. Taqman Analysis of EG-VEGF Expression 

Hypoxia is a key inducer of angiogenesis in both physiological and pathological conditions. By virtue of 
30 activating hypoxia-inducible factor (HIF)1, low oxygen tension is known to induce expression of VEGF, in addition to 
erythropoietin (Epo) and certain glycolytic enzymes, through cis-acting regulatory elements Semenza, J. Appl. Physiol. 
88:1474-1480 (2000). Therefore, we sought to determine whether hypoxia regulates EG-VEGF mRNA expression in 
endothelial cells. 

For expression analysis, RNA isolates from replicate, matched samples of normoxic-treated versus hypoxia- 
35 exposed SW13 and H295R cells (both from the ATCC) were prepared using the Rneasy kit (Qiagen) as described by 
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the manufacturer. For real-time quantitative RT-PCR, 50 ng of total RNA was assayed in triplicate with the Perkin 
Elmer Taqman kit reagents and an ABI prism 7700 Sequence Detector. Oligos and probes used are as follows: 

forward EG-VEGF PCR primer 5'-CCGGCAGCCACAAGGTC-3' (SEQ ID N0:9) 
5 reverse EG-VEGF PCR primer 5'-TGGGCAAGCAAGGACAGG-3' (SEQ ID N0:1 0) 

EG-VEGF probe 5'-CCTTCTTCAGGAAACGCAAGCACCAC-3' (SEQ ID N0:1 1) 

forward VEGF PCR primer 5'-AATGACGAGGGCCTGGAGT-3' (SEQ ID N0:12) 

reverse VEGF PCR primer 5'-TTGATCCGcATAATCTGCATG-3' (SEQ ID NO: 13) 

VEGF probe 5'-TGTGCCCACTGAGGAGTCCAACATCA-3' (SEQ ID NO: 14) 
10 As a negative control, expression analysis of -actin was similarity performed as described above with 

primers and probe supplied by the manufacturer (Perkin-Elmer). 

Figure 17A, panel a illustrates the finding that exposure of the human adrenal carcinoma cell lines SW13 and 

H295R to hypoxic conditions ("2% oxygen) resulted in a 275+ 15% and 210± 12% increase in EG-VEGF mRNA levels 

above normoxia, respectively, while the VEGF mRNA increased 352±30% and 266+14%, respectively. A search of 
15 the EG-VEGF promoter sequence for the core HIF-1 binding site revealed a putative element within the first 2450 

nucleotides of the transcription start site, based on a consensus sequence (5TACGTGCGGC-3', bolded text 

represents invariable sequence). 

B. HRE Luciferase Activity Analysis 
20 To determine whether the hypoxic regulation of the EG-VEGF gene was mediated by HIF-1, expression of a 

luciferase reporter gene under the control of the putative EG-VEGF element was analyzed. The luciferase reporter 
constructs were generated by cloning compatible, annealed oligos into the Bglll site of the pGL3-Promoter vector 
(Promega): 

HRE_consensus(£/t?0) sense 5'-AGGCCCTACGTGCGGCCTCACACAGCCTGTTCTGA-3' (SEQ ID M0:15) 
25 HRE_mutant(£/w) sense 5'-AGGCCCTAATTGCGGCCTCACACAGCCTGTTCTGA-3' (SEQ ID N0:16) 

HRE EG VEGF sense 5'-GCTAAGGACGTGCTATTCATGGGGTGCAGGAAGAT-3' (SEQ ID NO: 17) 
HREJG-VEGF mutant sense 5'-GCTAAGGAATTGCTATTCATGGGGTGCAGGAAGAT-3' (SEQ ID N0:18) 

Transient transfections of the luciferase reporter constructs into HeLa cells were performed in 6-well dishes 
using Effectene transfection reagent (Qiagen). For each well, 0.75 g of reporter along with 0.25 g of pRL-SV40 
30 control vector was transfected. Triplicate, paired transfections were used in parallel normoxic versus hypoxic 

incubations, dishes being exposed to hypoxia for 18-24 hours commencing 24 hours after transfection. Cells were 
lysed and assayed using the dual-luciferase reporter assay system (Promega). Luciferase activity is normalized to 
control vector values, and a representative experiment is shown in Figure 17B error bars representing standard 
deviation. _ 
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Following an 18-24 hour incubation in hypoxic conditions, a lucif erase reporter construct containing the 
putative EG-VEGF element conferred a 3.3+0.8-fold increase above normoxic conditions. This level is comparable to 
that conferred by the hypoxia-response element (HRE) Epo consensus, 3.4± 1 .2-fold (Figure 1 7, panel b). Mutating the 
core sequence of either consensus or the putative EG-VEGF HRE abolished the response to hypoxia, verifying the 
5 specificity of the response. While we cannot rule out additional mechanisms, these findings indicate that in all 

likelihood HIF-1 is a key mediator of the hypoxic regulation of the EG-VEGF gene. 

Thus, it is striking that in spite of the lack of sequence homology, the EG-VEGF and VEGF genes, not only 
have similar effects in responsive endothelial cells, for example hypoxia-inducible expression, but may also be 
regulated through common mechanisms. 

10 

Example 19 
Heparin Binding Assay 

Angiogenic factors, including bFGF and VEGF, interact with extracellular matrix components, such as heparin 
sulfate proteoglycans. The interaction between angiogenic factors with extracellular matrix components is suggested 
15 to regulate the bioavailability and activity of these molecules (Klagsburn, Semin Cancer Biol. 3:81-7 (1992)). Thus, 

we tested whether EG-VEGF binds to heparin. 

15 g of untagged protein was applied to a Hi-Trap heparin-sepharose column (Pharmacia) in 10 mM Tris, pH 
7 A 0.1 M NaCI. The column was eluted with a linear gradient of NaCI (0.1-2.0 M). Human VEGF 1B6 was used as a 
reference. 

20 When applied to a heparin sepharose column, EG-VEGF eluted in the presence of "0.5M NaCI, while VEGF 1B5 

eluted at "0.7 M NaCI under the same chromatographic conditions. This finding indicates that EG-VEGF is a heparin- 
binding growth factor which may be sequestered in the extracellular compartment in vivo. 

Example 20 

25 Selectivity of In vivo Angiogenic Effects 

A. Rat Corneal Pocket Anaionenesis Assay 

The in vivo activity of EG-VEGF was examined an adult rat corneal pocket assay, as previously described 
(Gille etal., J. Biol. Cfiem. 276:3222-3230 (2001)). Hydron coated pellets containing, 200 ng VEGF 165 or 500 ng EG- 
VEGF, plus methylcellulose and aluminum sucralfate were inserted into the base of the cornea pocket (n=6). Control 
30 pellets contained 500 ng CD4-lgG. At day 6, animals were euthanized and injected with high molecular weight FITC- 

dextran to allow for visualization of the vasculature. Measurements of neovascular areas in corneal whole mounts 
were performed using computer-assisted image analysis (Image-Pro Plus). 

In the rat corneal assay, purified EG-VEGF failed to show a significant response in all eyes tested, wherease 
VEGF induced the expected angiogenic effects (Figure 19, panels a-c).. Similar results are observed with the rabbit 
35 cornea. Note the strong angiogenic response induced by VEGF protein, while EG-VEGF has essentially no effect. 
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B. Angiogenic Effects of Site-Directed 'Adenoviral Injections of EG-VEGF 

To further study the in vivo activity of EG-VEGF, the effects of EG-VEGF delivery in a local and sustained 
fashion was examined. To achieve EG-VEGF delivery in a local and sustained fashion, adenovirus (Av) vectors were 
5 generated and injected into various sites in athymic nude rats or mice. Recombinant Av vectors expressing lacZ or 

VEGF served as controls. 

Athymic nude rats were anesthetized using isofluorane, and a 2-2.5 cm incision was made on the left dorsal 
area. The ovary was lifted and secured using nontraumatic forceps. Doses of 10 s or 5x1 0 8 pfu in 5-10 I of saline 
were injected via a gas-tight Hamilton syringe fitted with a 31 G needle (Hamilton). All work was performed in a 

10 hiosafety cabinet with BSL2 practices in place. For skeletal muscle (left gastrocnemius) or subcutaneous ear 

injections, athymic nude mice or rats were anesthetized with isofluorane, the area was cleaned, and doses of 5x10° 
pfu of each virus preparation in a 50 I volume was injected per site. Animals in all Av studies were euthanized one 
week after Av administration. At necropsy, tissues were dissected and frozen or fixed for histological analysis. 
When injected in the skeletal muscle of nude rats, the effects of lacZ and EG-VEGF Av were essentially 

1 5 indistinguishable, with minimal inflammatory infiltrate and absence of angiogenesis, and with abundant mitotic 

endothelial cells, edema and granulation tissue (Figure 19, panels d-f). Very similar results were obtained following 
injection of the same Av vectors in the mouse ear (Figure 19, panels g and h ). Again, VEGF Av resulted in a strong 
angiogenic response, which was absent in the animals injected with EG-VEGF Av. However, intra-ovarian delivery of 
either EG-VEGF or VEGF Av resulted within a week in a dramatic enlargement of the injected ovaries, with grossly 

20 visible blood vessels as well as hemorrhagic areas (Figure 1 9, panel i) The ovarian wet weights in a representative 

experiment in nude rats were 33+3 mg in the LacZ group, 489+3D mg in the EG-VEGF group and 191 ±91 mg in the 
VEGF group (n=4). The weight of noninjected ovaries (37±7 mg) was not different from the LacZ group. Although 
there was a trend toward larger mass in the EG-VEGF, compared to the VEGF group, the histological picture was very 
similar. In both cases, large fluid- or blood-filled cystic areas that disrupted the ovarian architecture were present 

25 (Figure 1 9, panels j-n). High power examination revealed intense angiogenesis at the periphery of the cysts (Figure 1 9, 

panels m-n). The morphology of the LacZ ovaries was normal. Thus, VEGF and EG-VEGF are able to trigger the same 
chain of events in a responsive tissue, including the ability to induce fluid extravasation, in addition to angiogenesis. 
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Deposit of Material 

The following materials have been deposited with the American Type Culture Collection, 10801 University 
5 Blvd., Manassas, VA 20 1 1 0-2209, USA (ATCC): 

Material ATCC Dep. No. Deposit Date 

DNA6062M516 203091 August 4, 1998 

10 This deposit was made under the provisions of the Budapest Treaty on the International Recognition of the 

Deposit of Microorganisms for the Purpose of Patent Procedure and the Regulations thereunder (Budapest Treaty). 
This assures maintenance of a viable culture of the deposit for 30 years from the date of deposit. The deposit will be 
made available by ATCC under the terms of the Budapest Treaty, and subject to an agreement between Genentech, 
Inc. and ATCC, which assures permanent and unrestricted availability of the progeny of the culture of the deposit to 

15 the public upon issuance of the pertinent U.S. patent or upon laying open to the public of any U.S. or foreign patent 

application, whichever comes first, and assures availability of the progeny to one determined by the U.S. 
Commissioner of Patents and Trademarks to be entitled thereto according to 35 USC 122 and the Commissioner's 
rules pursuant thereto (including 37 CFR 1.14 with particular reference to 886 OG 638). 

The assignee of the present application has agreed that if a culture of the materials on deposit should die or 

20 be lost or destroyed when cultivated under suitable conditions, the materials will be promptly replaced on notification 
with another of the same. Availability of the deposited material is not to be construed as a license to practice the 
invention in contravention of the rights granted under the authority of any government in accordance with its patent 
laws. 

The foregoing written specification is considered to be sufficient to enable one skilled in the art to practice 
25 the invention. The present invention is not to be limited in scope by the construct deposited, since the deposited 

embodiment is intended as a single illustration of certain aspects of the invention and any constructs that are 
functionally equivalent are within the scope of this invention. The deposit of material herein does not constitute an 
admission that the written description herein contained is inadequate to enable the practice of any aspect of the 
invention, including the best mode thereof, nor is it to be construed as limiting the scope of the claims to the specific 
30 illustrations that it represents. Indeed, various modifications of the invention in addition to those shown and described 

herein will become apparent to those skilled in the art from the foregoing description and fall within the scope of the 
appended claims. 
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WHAT IS CLAIMED IS : 

1. A composition of matter comprising (a) an EG-VEGF polypeptide, (b) an agonist of an EG-VEGF 
polypeptide, or (c) an antagonist of an EG-VEGF polypeptide, in admixture with a pharmaceutical^ acceptable carrier. 

2. The composition of claim 1 wherein said EG-VEGF polypeptide is a native sequence EG-VEGF. 
5 3. The composition of claim 2 wherein said native sequence EG-VEGF is human. 

4. The composition of claim 1 wherein said agonist or antagonist is an anti-EG-VEGF-antibody, a small 
molecule or an antisense molecule. 

5. The composition of claim 1 further comprising a vascular endothelial cell growth factor (VEGF), or 
an agonist or antagonist thereof. 

10 6. An article of manufacture, comprising: 

a container; 

a label on the container; and 

a composition comprising an active agent contained within the container, 

wherein said active agent is selected from the group consisting of (a) an EG-VEGF polypeptide, (b) an agonist 
15 of an EG-VEGF polypeptide, and (c) an antagonist of an EG-VEGF polypeptide; and the label on the container indicates 

that the composition is effective for treating of a condition that is associated with a hormone producing endothelial 
tissue. 

7. The article of manufacture of claim 6 wherein said condition is associated with steroidogenic 
endothelial cells within an endocrine gland. 
20 8. The article of manufacture of claim 6 wherein said tissue is ovarian, testicular, cervical, adrenal, 

placental or prostate tissue. 

9. The article of manufacture of claim 6 wherein said condition is infertility, polycystic ovary 
syndrome, or cancer. 

1 0. A method for identifying a compound that binds to EG-VEGF comprising: 
25 a) contacting a candidate compound with EG-VEGF; and 

b) determining whether said candidate compound binds to said EG-VEGF. 

11. The method of claim 10 wherein said EG-VEGF is a native sequence EG-VEGF, or an amino acid 
sequence variant thereof. 

12. The method of claim 11 wherein said amino acid sequence variant has at least about 85% 
30 sequence identity to the sequence of amino acid residues from about 1 or about 20 to about 1 05, inclusive, of Figure 

2 (SEQID NO: 2). 

13. The method of claim 12 wherein said amino acid sequence variant is a conservative substitution 

variant. 

1 4. The method of claim 1 0 wherein said EG-VEGF is present as a fusion protein. 
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15. The method of claim 10 wherein the ability of said candidate compound to compete with a 
molecule known to bind EG-VEGF is measured. 

1 6. The method of claim 1 5 wherein said candidate compound is contacted with a whole cell or a cell 
membrane fraction expressing the coding sequence of EG-VEGF. 

5 17. A method for identifying a compound that modulates a biological activity of EG-VEGF, comprising 

the steps of: 

a) contacting a candidate compound with EG-VEGF; and 

b) determining an alteration in said biological activity of EG-VEGF. 

1 8. The method of claim 1 7 wherein said compound inhibits a biological activity 
10 of said EG-VEGF. 

1 9. The method of claim 1 7 wherein said compound enhances a biological activity of said EG-VEGF. 

20. The method of claim 1 7 wherein said biological activity is the ability to induce phosphorylation of a 
kinase involved in cell proliferation or survival. 

21 . The method of claim 20 wherein said kinase is a MAP kinase. 

15 22. The method of claim 21 wherein said MAP kinase is ERK1 or ERK2. 

23. The method of claim 17 wherein said biological activity is cell proliferation activity, the induction 
of chemotaxis, angiogenesis, the induction of cell differentiation, or the induction of endothelial cell proliferation. 

24. The method of claim 17 wherein said candidate compound is contacted with a whole cell or a cell 
membrane fraction expressing the coding sequence of EG-VEGF. 

20 25. The method of claim 24 wherein said ceil is a recombinant host cell engineered to express said EG- 

VEGF. 

26. The method of claim 1 7 wherein said candidate compound is contacted with an isolated EG-VEGF. 

27. The method of claim 26 wherein said EG-VEGF is immobilized on a solid support. 

28. A compound identified by the method of any one of claims 1 7 to 27. 

25 29. A method of identifying a receptor for EG-VEGF, said method comprising combining EG-VEGF with a 

composition comprising cell membrane material wherein said EG-VEGF complexes with a receptor on said cell 

membrane material, and identifying said receptor as a EG-VEGF receptor. 

30. The method of Claim 42 wherein EG-VEGF binds tD said receptor, and said method further includes 

a step of crosslinking said EG-VEGF and said receptor. 
30 31. A method of inducing cell proliferation, comprising contacting said cells with EG-VEGF in an amount 

effective to induce proliferation of said cells. 

32. The method of claim 31 wherein said cells are endothelial cells. 

33. The method of claim 32 wherein said endothelial cells are steroidogenic endothelial cells. 

34. The method of Claim 33 further comprising contacting said cells with VEGF. 
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35. A method of inducing cell proliferation, said method comprising introducing a nucleic acid encoding 
EG-VEGF into said cells in an amount effective to induce cell proliferation. 

36. The method of claim 35 wherein said cells are endothelial cells. 

37. The method of claim 36 wherein said endothelial cells are steroidogenic endothelial cells. 

5 38. The method of Claim 36 further comprising introducing a nucleic acid encoding VEGF to said cells. 

39. A method of inducing chemotaxis in cells, comprising contacting said cells with EG-VEGF in an 
amount effective to induce chemotaxis. 

40. The method of claim 39 wherein said cells are endothelial cells. 

41 . The method of claim 40 wherein said cells are steroidogenic endothelial cells. 
1 0 42. The method of Claim 40 further comprising contacting said cells with VEGF. 

43. A method of inducing chemotaxis in cells, said method comprising introducing a nucleic acid 
encoding EG-VEGF into said cells in an amount effective to induce chemotaxis. 

44. The method of claim 43 wherein said cells are endothelial cells. 

45. The method of claim 44 wherein said cells are steroidogenic endothelial cells. 

1 5 46. The method of Claim 45 further comprising introducing a nucleic acid encoding VEGF to said cells. 

47. A method of treating an individual for a condition associated with hormone producing tissue, said 
method comprising administering to said individual a composition comprising EG-VEGF or an agonist or antagonist 
thereof, or nucleic acid encoding EG-VEGF or an agonist or antagonist thereof, in an amount effective to treat said 
condition. 

20 48. The method of Claim 47 further comprising administering VEGF or an agonist or antagonist thereof, 

or nucleic acid encoding VEGF or an agonist or antagonist thereof to said individual. 

49. A method of inducing angiogenesis in hormone producing tissue, comprising contacting said tissue 

with EG-VEGF or an agonist thereof, or introducing into said tissue nucleic acid encoding EG-VEGF or an agonist 

thereof, in an amount effective to induce angiogenesis. 
25 50. The method of Claim 49 further comprising contacting said tissue with VEGF or an agonist thereof, 

or introducing into said tissue nucleic acid encoding VEGF or an agonist thereof. 

51. A method of inducing cell differentiation in cells, comprising contacting said cells with EG-VEGF or 

an agonist thereof, or introducing into said cells nucleic acid encoding EG-VEGF or an agonist thereof, in an amount 

effective to induce cell differentiation. 
30 52. The method of Claim 5 1 further comprising contacting said cells with VEGF or an agonist thereof. 

53. A method of inducing fenestration in cells, contacting said cells with EG-VEGF or an agonist 
thereof, or introducing into said cells nucleic acid encoding EG-VEGF or an agonist thereof, in an amount effective to 
induce fenestration. 

54. The method of Claim 53 wherein said cells are endothelial cells. 

35 55. The method of claim 54 wherein said cells are steroidogenic endothelial cells. 
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56. The method of Claim 54 further comprising contacting said cells with VEGF or an agonist thereof, 
or introducing into said cells nucleic acid encoding VEGF or an agonist thereof. 

57. A method of inhibiting endothelial cell proliferation, comprising contacting said cells with an EG- 
VEGF antagonist in an amount effective to inhibit cell proliferation. 

5 58. A method of inhibiting chemotaxis in endothelial cells, comprising contacting said cells with an EG- 

VEGF antagonist in an amount effective to inhibit chemotaxis. 

59. A method of inhibiting angiogenesis in hormone producing tissue, comprising contacting said tissue 
with an EG-VEGF antagonist in an amount effective to inhibit angiogenesis. 

60. A method of inhibiting differentiation of cells, comprising contacting said cells with an EG-VEGF 
1 0 antagonist in an amount effective to inhibit cell differentiation. 

61. A method of regulating fertility in an individual, comprising administering an EG-VEGF antagonist to 
said individual in an amount effective to regulate fertility. 

62. The method of Claim 61 wherein said fertility is regulated by inhibiting follicle maturation. 

63. The method of Claim 61 wherein said fertility is regulated by inhibiting ovulation. 

15 64. The method of Claim 61 wherein said individual has or is at risk of having polycystic ovary 

syndrome and fertility is regulated so as to maintain fertility. 

65. A method of treating cancer in cells responsive to EG-VEGF in an individual, said method comprising 
administering an EG-VEGF antagonist to said individual in an amount effective to treat cancer. 

66. A method of treating cancer of the reproductive organs in an individual, comprising administering 
20 an EG-VEGF antagonist to said individual in an amount effective to inhibit cancer. 

67. The method of Claim 66 wherein said cancer is selected from the group consisting of ovarian 
cancer, testicular cancer, prostate cancer, and uterine cancer. 

68. The method of claim 67 wherein said cancer is hormone-dependent cancer. 

69. The method of claim 68 wherein said cancer is androgen-dependent cancer. 

25 70. The method of claim 69 wherein said cancer is androgen-dependent prostate cancer. 

71. A method of treating an ovarian cyst in an individual, comprising administering an EG-VEGF 
antagonist to said individual in an amount effective to inhibit growth of a cyst. 

72. A method of regulating permeability of an endothelial cell comprising contacting said endothelial 
cell with an effective amount of EG-VEGF. 

30 73. The method of claim 72 further comprising contacting said endothelial cell with VEGF. 

74. The method of claim 72 wherein said cell is an endothelial cell of the adrenal cortex. 

75. The method of claim 72 wherein said cell is an endothelial cell of the ovary. 
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FIGURE 1 

TGGCCTCCCCAGCTTGCCAGGCACAAGGCTGAGCGGGAGGAAGCGAGAGGCATCTA 

AGCAGGCAGTGTTTTGCCTTCACCCCAAGTGACCATGAGAGGTGCCACGCGAGTCTC 

AATCATGCTCCTCCTAGTAACTGTGTCTGACTGTGCTGTGATCACAGGGGCCTGTGA 

GCGGGATGTCCAGTGTGGGGCAGGCACCTGCTGTGCCATCAGCCTGTGGCTTCGAGG 

GCTGCGGATGTGCACCCCGCTGGGGCGGGAAGGCGAGGAGTGCCACCCCGGCAGCC 

ACAAGGTCCCCTTCTTCAGGAAACGCAAGCACCACACCTGTCCTTGCTTGCCCAACC 

TGCTGTGCTCCAGGTTCCCGGACGGCAGGTACCGCTGCTCCATGGACTTGAAGAACA 

TCAATTTT TAG GCGCTTGCCTGGTCTCAGGATACCCACCATCCTTTTCCTGAGCACAG 

CCTGGATTTTTATTTCTGCCATGAAACCCAGCTCCCATGACTCTCCCAGTCCCTACAC 

TGACTACCCTGATCTCTCTTGTCTAGTACGCACATATGCACACAGGCAGACATACCT 

CCCATCATGACATGGTCCCCAGGCTGGCCTGAGGATGTCACAGCTTGAGGCTGTGGT 

GTGAAAGGTGGCCAGCCTGGTTCTCTTCCCTGCTCAGGCTGCCAGAGAGGTGGTAAA 

TGGCAGAAAGGACATTCCCCCTCCCCTCCCCAGGTGACCTGCTCTCTTTCCTGGGCCC 

TGCCCCTCTCCCCACATGTATCCCTCGGTCTGAATTAGACATTCCTGGGCACAGGCTC 

TTGGGTGCATTGCTCAGAGTCCCAGGTCCTGGCCTGACCCTCAGGCCCTTCACGTGA 

GGTCTGTGAGGACCAATTTGTGGGTAGTTCATCTTCCCTCGATTGGTTAACTCCTTAG 

TTTCAGACCACAGACTCAAGATTGGCTCTTCCCAGAGGGCAGCAGACAGTCACCCCA 

AGGCAGGTGTAGGGAGCCCAGGGAGGCCAATCAGCCCCCTGAAGACTCTGGTCCCA 

GTCAGCCTGTGGCTTGTGGCCTGTGACCTGTGACCTTCTGCCAGAATTGTCATGCCTC 

TGAGGCCCCCTCTTACCACACTTTACCAGTTAACCACTGAAGCCCCCAATTCCCACA 

GCTTTTCCATTAAAATGCAAATGGTGGTGGTTCAATCTAATCTGATATTGACATATTA 

GAAGGCAATTAGGGTGTTTCCTTAAACAACTCCTTTCCAAGGATCAGCCCTGAGAGC 

AGGTTGGTGACTTTGAGGAGGGCAGTCCTCTGTCCAGATTGGGGTGGGAGCAAGGG 

ACAGGGAGCAGGGCAGGGGCTGAAAGGGGCACTGATTCAGACCAGGGAGGCAACT 

ACACACCAACATGCTGGCTTTAGAATAAAAGCACCAACTGAAAAAA 
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FIGURE 2 

MRGATRVSIMLLLVTVSDCAVITGACERDVQCGAGTCCAISLWLRGLRMCTPLGREGEE 
C 

HPGSHKWFFRKRKmTCPCLPNLLCSRFPDGRYRCSMDLKNINP 



Important features: 

Signal peptide: 
1-19 



N-myristoylation sites: 
33 
35 
46 
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FIGURE 3 A 

PRO XXXXXXXXXXXXXXX (Length = 1 5 amino acids) 

Comparison Protein XXXXXYYYYYYY (Length = 12 amino acids) 

% amino acid sequence identity = 

(the number of identically matching amino acid residues between the two polypeptide sequences 
as determined by ALIGN-2) divided by (the total number of amino acid residues of the PRO 
polypeptide) = 

5 divided by 15 = 33.3% 
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FIGURE 3B 



PRO 



Comparison Protein 



XXXXXXXXXX (Length =10 amino acids) 

XXXXXYYYYYYZZYZ (Length = 15 amino acids) 



% amino acid sequence identity 

(the number of identically matching amino acid residues between the two polypeptide sequences 
as determined by ALIGN-2) divided by (the total number of amino acid residues of the PRO 
polypeptide) = 

5 divided by 10 = 50% 
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FIGURE 3C 



PRO-DNA 



MWNNNNNNNNNNN 



(Length = 14 nucleotides) 
(Length =16 nucleotides) 



Comparison DNA 



NNNNNNLLLLLLLLLL 



% nucleic acid sequence identity = 

(the number of identically matching nucleotides between the two nucleic acid sequences as 
determined by ALIGN-2) divided by (the total number of nucleotides of the PRO-DNA nucleic 
acid sequence) = 

6 divided by 14 = 42.9% 
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FIGURE 3D 



PRO-DNA 



NNNNNNN]WNNN 



(Length = 12 nucleotides) 

(Length = 9 nucleotides) 



Comparison DNA 



NNNNLLLVV 



% nucleic acid sequence identity = 

(the number of identically matching nucleotides between the two nucleic acid sequences as 
determined by ALIGN-2) divided by (the total number of nucleotides of the PRO-DNA nucleic 
acid sequence) = 

4 divided by 12 = 33.3% 
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FIGURE 4 

TGGCTCCCCAGCTTGCCAGGCACAAGGCTGAGCTGGAGGAAGCGAGANGCATCTAA 
GCAG 

GCAGTGTTTTGCCTTCACCCCAAGTGACCATGAGAGGTGCCACGCGAGTCTCAATCA 
TGC 

TCCTCCTAGTAACTGTGTCTGACTGTGCTGTGATCACAGGGGCCTGTGAGCGGGATG 
TCC 

AGTGTGGGGCAGGCACCTGCTGTGCCATCAGCCTGTGGCTTCGAGGGCTGCGGATGT 
GCA 

CCCCGCTGGGGCGGGAAGGCGAGGAGTGCCACCCCGGCAGCCACAAGGTCCCCTTC 
TTCA 

GGAAACGCAAGCACCACACCTGTCTTGTTGCCCAACCTGCTGTGCTCCAGTTCCGGA 
CGG 

CAGTACGCTGCTCA 
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FIGURE 5A 




FIGURE 5B 
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FIGURE 5C 
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FIGURE 5E 
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FIGURE 6 A 
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FIGURE 6B 
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FIGURE 6C 
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FIGURE 7 



FIGURE 7 
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FIGURE 9 
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FIGURE 10 A 
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FIGURE 10 C 
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FIGURE 11 C 
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FIGURE 12 A 
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FIGURE 12 B 
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FIGURE 12 C 
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FIGURE 12 D 
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FIGURE 12 E 
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FIGURE 14 A 
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GTGATCACAGGGGCCTGTGAGCGGGATGTCCAGTGTGGGGCAGGCACCTG 200 

VITGACERDVQCGAGTC 
CTGTGCCATCAGCCTGTGGCTTCGAGGGCTGCGGATGTGCACCCCGCTGG 

CAISLWLRGLRMCTPL 
GGCGGGAAGGCGAGGAGTGCCACCCCGGCAGCCACAAGGTCCCCTTCTTC 



G R 



G E E C H 



G S H K V 



F F 



AGGAAACGCAAGCACCACACCTGTCCTTGCTTGCCCAACCTGCTGTGCTC 
RKRKHHTCPCLPNLLCS 
CAGGTTCCCGGACGGCAGGTACCGCTGCTCCATGGACTTGAAGAACATCA 400 

RFPDGRYRCSMDLKNI 
ATTTTTAGGCGCTTGCCTGGTCTCAGGATACCCACCATCCTTTTCCTGAG 
N F * 

CACAGCCTGGATTTTTATTTCTGCCATGAAACCCAGCTCCCATGACTCTC 



GCACACAGGCAGACATACCTCCCATCATGACATGGTCCCCAGGCTGGCCT 600 

GAGGATGTCACAGCTTGAGGCTGTGGTGTGAAAGGTGGCCAGCCTGGTTC 

TCTTCCCTGCTCAGGCTGCCAGAGAGGTGGTAAATGGCAGAAAGGACATT 

CCCCCTCCCCTCCCCAGGTGACCTGCTCTCTTTCCTGGGCCCTGCCCCTC 

TCCCCACATGTATCCCTCGGTCTGAATTAGACATTCCTGGGCACAGGCTC 800 

TTGGGTGCATTGCTCAGAGTCCCAGGTCCTGGCCTGACCCTCAGGCCCTT 

CACGTGAGGTCTGTGAGGACCAATTTGTGGGTAGTTCATCTTCCCTCGAT 

TGGTTAACTCCTTAGTTTCAGACCACAGACTCAAGATTGGCTCTTCCCAG 

AGGGCAGCAGACAGTCACCCCAAGGCAGGTGTAGGGAGCCCAGGGAGGCC 1 000 

AATCAGCCCCCTGAAGACTCTGGTCCCAGTCAGCCTGTGGCTTGTGGCCT 

GTGACCTGTGACCTTCTGCCAGAATTGTCATGCCTCTGAGGCCCCCTCTT 

ACCACACTTTACCAGTTAACCACTGAAGCCCCCAATTCCCACAGCTTTTC 

CATTAAAATGCAAATGGTGGTGGTTCAATCTAATCTGATATTGACATATT 1 200 

AGAAGGCAATTAGGGTGTTTCCTTAAACAACTCCTTTCCAAGGATCAGCC 

CTGAGAGCAGGTTGGTGACTTTGAGGAGGGCAGTCCTCTGTCCAGATTGG 

GGTGGGAGCAAGGGACAGGGAGCAGGGCAGGGGCTGAAAGGGGCACTGAT 

TCAGACCAGGGAGGCAACTACACACCAACATGCTGGCTTTAGAATAAAAG 1 400 

CACCAACTGAAAAAA 



MRGATRVSIMLLLVTVSDCk viTGACRRrlv^AhTrr^ 
MLLLLLLLPPLLLPRAGDAWITGAC{ra]l S 3CG 3 ^TCAVsL 
WI TGACERq i JjCGMGTCCAVSLt 
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FIGURE 20 a 



* C-C increased from 12 to 15 

* Z is average of EQ 

* B is average of ND 

* match with stop is _M; stop-stop = 0; J (joker) matcl 
*/ 

?define M -8 /* value of a mate 



int 

/* A 

/*A*/ 

l*BV 

l*C*l 

l*DV 

I*E*/ 

1*G*/ 
/*H*/ 
1*1*/ 
1*1*1 
/*K*/ 
l*h*/ 
i*M*/ 
/*N*/ 
1*0*1 
l*P*/ 

I* Q-*/ 

/*R*/ 
/*S*/ 
1*1*1 
/*U*/ 

/*v*/ 
/* w*/ 

I* XV 
I*YV 
I*ZV 

}; 



2,0,-2,0,0,-4,1,-1,-1,0,-1,-2,-1, 
{0,3,-4,3,2,-5,0,1,-2,0,0,-3,- 
{-2,-4,15,-5,-5,-4,-3,-3,-2,0,-5,-6,-5,- 

0, 3,-5, 4, 3,-6, 1,1,-2,0, 0,-4,-3, 2,J 

1,0,-3,1,0,-5,5,-2,-3,0,-2,-4,-3, 
{-1,1,-3,1,1,-2,-2, 6,-2, < 
{-1,-2,-2,-2,-2,1,-3,-2, 5,0, 

0,0,0,0,0,0,0,0,0,0,0 
{-1,0,-5,0,0,-5,-2, 0,-2,0, 
{-2,-3,-6,-4,-3,2,-4,-2,2,0,-3,6, 
{-1,-2,-5,-3,-2,0,-3,-2,2,0, 0,4,6,- 
{0,2,-4,2,1,-4,0,2,-2,0,1,-3,- 



PQRSTUVWXYZ*/ 
1,0,-2,1,1,0,0,-6,0,-3,0}, 
,-1,1,0,0,0,0,-2,-5,0,-3,1}, 

,-3,-5,-4,0,-2,0,-2,-8,0,0,-5}, 
,-1,2,-1,0,0,0,-2,-7,0,-4,2}, 



1,0,0,0,0,-3,1,-1,-1,0,0,-3,-2, 
1,0,-2,0,0,-3,0,-1,0,0,0,-1,-1, 
0, 0,0,0,0,0, 0,0,0,0,0, 0, 0," 

-6'-5,-8,-7,-7, 0,-7,-iX 0,-3,-2, 
0,1,-5,2,3,-5,0,2,-2,0,0,-2,-1, 



M.-1, 2,-1, 0,0, 0,-2,-7, 0,-4, 3}, 
M,-5,-5,-4,-3,-3, 0,-1, 0,0, 7,-5}, 
,-1,-1,-3,1,0,0,-1,-7, 0,-5,0}, 




1,0,0,0,0,0,0,0,0,0}, 
1,3,0,0,0,-2,-3,0,-4,0}, 
,-3,-2,-3,-3,-1,0,2,-2,0,-1,-2}, 
,-2,-1,0,-2,-1,0,2,-4,0,-2,-1}, 
1,0, 1,0, 0,-2,-4, 0 ; 




M,_M,_M,_M,_M,_M,_M,_M}, 



1,0,4,1,-1,-1,0,-2,-5,0,-4,3}, 
4,0,1,6,0,-1,0,-2,2,0,-4,0}, 
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include <stdio.h> 
#include <ctype.h> 

#define MAXJMP 

#define MAXGAP 

#define JMPS 

#define MX 

#deflne DMAT 

#defme DMIS 

Sdefine DINSO 

#deflne DINS] 

#defme PINSO 

#defme PINS1 

struct jmp { 

unsigned short 



/* max jumps in a diag */ 

/* don't continue to penalize gaps larger than this */ 
/* max jmps in an path */ 

/* save if there's at least MX-1 bases since last jmp */ 

I* value of matching bases */ 

/* penalty for mismatched bases */ 

/* penalty for a gap */ 

/* penalty per base */ 

/* penalty for a gap */ 

/* penalty per residue */ 



n[MAXJMPj; 
xfMAXJMP]; 



/* size of jmp (neg for dely) */ 
/* base no. of jmp in seq x */ 
/* limits seq to 2*16 -1 */ 



long 
short 
struct jmp 



offset; 
ijmp; 



/* score at last jmp */ 
/* offset of prev block */ 
/* current jmp index */ 
/* list of jmps */ 







spc; /* number of leading spaces */ 






n[JMPS];/* size of jmp (g 


ap)*/ 


}; 


int 


xfJMPS];/* loc of jmp (last elem before gap) */ 


char 




*ofile; 


/* output file name */ 


char 




*namex[2]; 


/* seq names: getseqsQ */ . 






*prog; 


/* prog name for err msgs */ 


char 




*seqx[2]; 


/* seqs: getseqsQ */ 


int 




dmax; 


/* best diag: nwO */ 


hit 




dmaxO; 


/* final diag */ 


int 




dna; 


/* set if dna: mainO */ 


int 




endgaps; 


/* set if penalizing end gaps */ 


int 




gapx, gapy; 


/* total gaps in seqs */ 


int 




lenO, lenl; 


/* seq lens */ 


int 




ngapx, ngapy; 


/* total size of gaps */ 


int 






/* max score: nw() */ 


int 




*xbm; 


/* bitmap for matching */ 






offset; 


/* current offset in jmp file */ 


struct 


diag 


*dx; 


/* holds diagonals */ 


struct 




ppP]; 


/* holds path for seqs */ 


char 




*calloc(), *malloc(), *indexQ, *strcpy(); 


char 




*getseq(), *g_callocQ; 
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FIGURE 20 c 

/* Needleman-Wunsch alignment program 

* usage: progs filel flle2 

* where filel and file2 are two dna or two protein sequences. 

* The sequences can be in upper- or lower-case an may contain ambiguity 

* Any lines beginning with ';', '>' or '<' are ignored 

* Max file length is 65535 (limited by unsigned short x in the jmp struct) 

* A sequence with 1/3 or more of its elements ACGTU is assumed to be DNA 

* Output is in the file "align.out" 

* The program may create a tmp file in /tmp to hold info about traceback. 

* Original version developed under BSD 4.3 on a vax 8650 
*/ 

#include "nw.h" 
#include "day.h" 

static _dbval[26] = { 

1,14,2,13,0,0,4,11,0,0,12,0,3,15,0,0,0,5,6,8,8,7,9,0,10,0 

}; 

static _pbval[26] = { 

1, 2|(1«( I D'-A , ))|(1«( , N , - , A , )) J 4, 8, 16, 32, 64, 
128, 256, OxFFFFFFF, 1«10, 1«12, 1«13, 1«14, 

1«15, 1«16, 1«17, 1«18, 1«19, 1«20, 1«21, 1«22, 
1«23, I«24, 1«25|(1«('E'-•A•))|(1«(•Q•- , A , )) 



char *av[]; 

prog = av[0]; 
if(ac!=3){ 

fprintfi;stderr,"usage: %s filel file2\n", prog); 

fprintf(stderr,"where filel and file2 are two dna or two protein sequencesAn"); 
fprintf(stderr,"The sequences can be in upper- or lower-case\n"); 
fprintf(stderr,"Any lines beginning with ';' or '<' are ignored\n"); 
fprintf(stderr,"Output is in the file Y'align.outW); 
exit(l); 

} 

namex[0] = av[l]; 

namex[l] = av[2]; 

seqx[0] = getseq(namex[0], &len0); 

seqx[l] = getseq(namex[l], &lenl); 

xbm = (dna)? dbval : _pbval; 



endgaps = 0; 
ofile = "align.out"; 

nw(); 

readjmps(); 
printQ; 



/* 1 to penaliz 
/* output file */ 



/* fill in the matrix, get the 
/* get the actual jmps */ 
/* print stats, alignment */ 

/* unlink any tmp files */ 
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FIGURE 20 d 

/* do the alignment, return best score: mainQ 



* dna: values in Fitch an 


d Smith, PNAS, 80, 


1382-1386,1983 


* pro: PAM 250 values 






* When scores are equal 


, we prefer mismatches to any gap, prefer 


* a new gap to extending an ongoing gap, and prefer a gap in seqx 


* to a gap in seq y. 
*/ 






nw() 
{ 






char 


*p X > *py; 


/* seqs and ptrs */ 


int 


*ndely, *dely; 


/* keep track of dely */ 


int 


ndelx, delx; 


/* keep track of delx */ 


int 


*tmp; 


/* for swapping rowO, row! */ 


int 




/* score for each type */ 


int 


insO, insl; 


/* insertion penalties *1 


register 


id; 


1* diagonal index */ 


register 


ij; 


/* jmp index */ 


register 


*co!0, *coll; 


/* score for curr, last row */ 


register 


xx, yy; 


/* index into seqs */ 



dx = (struct diag *)g_calloc("to get diags", lenO+lenl+1, sizeof(struct diag)); 

ndely = (int *)g_calloc("to get ndely", lenl+1, sizeof(int)); 
dely = (int *)g_calloc("to get dely", lenl+1, sizeof(int)); 
colO = (int *)g_calloc("to get colO", lenl+1, sizeof(int)); 
coll = (int *)g_calloc("to get coll", lenl+1, sizeofilnf)); 
ins0=(dna)?DINS0:PINS0; 
insl = (dna)? DINS 1 :PINS1; 

smax = -10000; 
if (endgaps) { 

for (co!0[0] = dely[0] = -insO, yy = 1; yy <= lenl ; yy++) { 
col0[yy] = delyfyy] = colO[yy-l] - insl ; 
ndelyfyy] = yy; 

} 

col0[0] = 0; /* Waterman Bull Math Biol 84 */ 

} 

else 

for(yy= 1; yy<=lenl; yy++) 
dely[yy] = -insO; 

/* fill in match matrix 

*/ 

for (px - seqx[0], xx = 1; xx <= lenO; px++, xx++) { 
/* initialize first entry in col 
*/ 

if (endgaps) { 

if(xx = l) 

coll [0] => delx = -(insO+insI); 

else 

col 1 [0] = delx = col0[0]- insl; 
ndelx = xx; 

} 

else { 

coll[O] = 0; 
delx = -insO; 
ndelx = 0; 

} 



Page 2 of nw.c 
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ir (py = seqxfl], yy = 1 ; yy <= lenl ; py++, yy++) { 
mis = colO[yy-l]; 
if(dna) 

mis += (xbm[*px-A']&xbm[*py- , A'])? DMAT : DMIS; 

else 

mis += _day[*px- , A , ][*py-'A']; 

/* update penalty for del in x seq; 

* favor new del over ongong del 

* ignore MAXGAP if weighting endgaps 
*/ 

if (endgaps || ndelyfyy] < MAXGAP) { 

if (colOfyy] - insO >= dely[yy]) { 

delyfyy] = col0[yy] - (insO+insl); 
ndelyfyy] = 1; 

} else { 

delyfyy] -= ins 1; 
ndely[yy]++; 

} 

}else{ 

if (colOfyy] - (insO+insl) >= delyfyy]) { 

delyfyy] = colOfyy] - (insO+insl); 
ndelyfyy] = 1 ; 

ndely[yy]++; 

} 

/* update penalty for del in y seq; 

* favor new del over ongong del 
*/ 

if (endgaps || ndelx < MAXGAP) { 

if (coll[yy-l] -ins0>=delx) { 

delx = col 1 [yy-1] - (insO+insl); 
ndelx = 1; 

} else { 

delx-=insl; 
ndelx++; 

} 

}else{ 

if (coll[yy-l] - (insO+insl) >= delx) { 

delx = coll[yy-l] - (insO+insl); 
ndelx = 1; 

} else 

ndelx++; 



/* pick the maximum score; we're favoring 
* mis over any del and delx over dely 
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...nw 

id = xx -yy + lenl - 1; 

if (mis >= delx && mis >= dely[yy]) 

coll[yy] = mis; 
else if (delx >=dely[yy]){ 

coll[yy] = delx; 

ij = dx[id].ijmp; 

if (dx[id].jp.n[0] && (!dna ]| (ndelx >= MAXJMP 
&& xx > dx[id].jp.x[ij]+MX) || mis > dx[id].score+DINSO)) { 
dx[id].ijmp++; 
if (++ij>= MAXJMP) { 
writejmps(id); 
ij = dx[id].ijmp = 0; 
dx[id].offset = offset; 

offset += sizeof(struct jmp) + sizeof(offset); 

} 

} 

dx[id].jp.n[ij] = ndelx; 
dx[id].jp.x[ij] = xx; 
dx[id].score = delx; 

} 

else{ 

coll[yy] = dely[yy]; 
ij = dx[id].ijmp; 

if (dx[id].jp.n[0] && (!dna || (ndelyfyy] >= MAXJMP 

&& xx > dx[id].jp.x[ijj+MX) i| mis > dx[id].score+DINSO)) { 
dx[id].ijmp++; 
if(++ij>= MAXJMP) { 
writejmps(id); 
ij = dx[id].ijmp = 0; 
dx[id].offset = offset; 

offset += sizeof(struct jmp) + sizeof (offset); 

} 

} 

dx[id].jp.n[ij] = -ndelyfyy]; 
dx[id].jp.x[ij] = xx; 
dx[id].score = dely[yy]; 

} 

if (xx = lenO && yy < lenl) { 
/* last col 
*/ 

if (endgaps) 

coll [yy] -= insO+insl *(lenl -yy); 
if (coll[yy]>smax) { 

smax = coll[yy]; 

dmax = id; 

} 

} 

} 

if (endgaps && xx < lenO) 

coll [yy-1] — insO+insl *(len0-xx); 
if(coll[yy-l]>smax){ 

smax = coll [yy-1]; 

dmax = id; 

} 

tmp = colO; colO = coll ; coll = tap; 

} 

(void) free((char *)ndely); 
(void) free((char *)dely); 
(void) free((char *)col0); 

(void) free((char *)coll);} Page 4 of nw.c 
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print() - only routine visible outside this module 
static: 

getmatO — trace back best path, count matches: print() 

pr_alignO - print alignment of described in array pfj: printO 

dumpblockO - dump a block of lines with numbers, stars: pr alignO 

numsO -- put out a number line: dumpblockO 

putlineQ - put out a line (name, [num], seq, [num]): dumpblockO 

starsO - -put a line of stars: dumpblockO 

stripnameO — strip any path and prefix from a seqname 



tfdefine SPC 3 

#define P_LINE 256 /* maximum output line */ 

#define PSPC 3 /* space between name or num and seq */ 

extern _day[26][26]; 

int olen; /* set output line length */ 

FILE *fx; /* output file */ 

printO P rint 

int lx, ly, firstgap, lastgap; /* overlap */ 

if ((fx = fopen(ofile, "w")) = 0) { 

fprintf(stderr,"%s: can't write %s\n", prog, ofile); 
cleanup(l); 

} 

fprintfrfx, "<first sequence: %s (length = %d)\n", namex[0], lenO); 
fprintf(fx, "<second sequence: %s (length = %d)\n", namex[l], lenl); 
olen = 60; 
lx = lenO; 
ly = lenl; 

firstgap = lastgap = 0; 

if (dmax < lenl - 1) { /* leading gap in x */ 

pp[0].spc = firstgap = lenl - dmax - 1; 
ly-=pp[0].spc; 

} 

else if (dmax > len 1 - 1 ) { /* leading gap in y */ 
pp[l].spc = firstgap = dmax - (lenl - 1); 
lx -= pp[l].spc; 

} 

if (dmaxO < lenO - 1 ) { /* trailing gap in x */ 
lastgap = lenO - dmaxO -1 ; 
lx -= lastgap; 

} 

else if (dmaxO > lenO - 1) { /* trailing gap m y */ 
lastgap = dmaxO - (lenO - 1); 
ly — lastgap; 

} 

getmatflx, ly, firstgap, lastgap); 
pralignO; 
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* trace back the best path, count matches 
*/ 

static 

getmat(lx, ly, firstgap, lastgap) 

lx, ly; /* "core" (minus endgaps) */ 

firstgap, lastgap; /* leading trailing overlap */ 



{ 



int nm, iO, il , sizO, sizl ; 

char outx[32]; 

double pet; 

register n0.nl; 

register char *p0, *pl; 

/* get total matches, score 



i"0 = il =siz0 = sizl =0; 
pO = seqx[0] + pp[l ].spc; 
pi = seqx[l] + pp[0].spc; 
n0 = pp[l].spc+ 1; 
nl = pp[0].spc+ 1; 

nm = 0; 

while ( *p0 && *pl ) { 
if(siz0){ 

pl++; 
nl++; 
sizO--; 



;eif(sizl){ 
p0++; 
n0++; 



if (xbm[*p0-'A']&xbm[*pl-'A']) 

if(n0++ = PP [0].x[i0]) 

siz0 = pp[0].n[i0++]; 

if(nm = pp[l].x[il]) 

sizl =pp[l].n[il++]; 

p0++; 
pl++; 



/* pet homology: 

* if penalizing endgaps, base is the shorter seq 

* else, knock off overhangs and take shorter core 

*/ 

if (endgaps) 

lx = (len0<lenl)? Ien0:lenl; 

else 

lx = (Ix<ly)?lx:ly; 
pct= 100*(double)nm/(double)lx; 

fprintf(fx, "\n"); 

fprintf(fx, "<%d match%s in an overlap of %d: %.2f percent similarity\n", 
nm, (nm = 1)? "" : "es", lx, pet); 
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fprintf(fx, "<gaps in first sequence: %d", gapx); 
if(gapx){ 

(void) sprintf(outx, " (%d %s%s)", 

ngapx, (dna)? "base":"residue", (ngapx= 1)? "":"s"); 
fprintf(fx,"%s", outx); 

fprintf(fx, ", gaps in second sequence: %d", gapy); 
if (gapy) { 

(void) sprintf(outx, " (%d %s%s)", 

ngapy, (dna)? "base": "residue", (ngapy= 1)? "":"s"); 

fprintf(fx,"%s", outx); 

} 

if (dna) 

fprintf(fx, 

"\n<score: %d (match = %d, mismatch = %d, gap penalty = %d + %d per base)\n", 
smax, DMAT, DMIS, DINSO, DINS 1 ); 

else 

fprintf(fx, 

"\n<score: %d (Dayhoff PAM 250 matrix, gap penalty = %d + %d per residue)\n", 
smax,PINS0, PINS1); 
if (endgaps) 

fprintf(fx, 

"<endgaps penalized, left endgap: %d %s%s, right endgap: %d %s%s\n", 
firstgap, (dna)? "base" : "residue", (firstgap = 1)? "" : "s", 
lastgap, (dna)? "base" : "residue", (lastgap = 1)? "" : "s"); 



else 



fprintf(fx, "<endgaps not penalizedW); 



static nm; /* matches in core — for checking */ 

static Imax; /* lengths of stripped file names */ 

static ij[2]; /* jmp index for a path */ 

static nc[2]; /* number at start of current line */ 

static ni[2]; /* current elem number - for gapping */ 

static siz[2]; 

static char *ps[2]; /* ptr to current element */ 

static char *po[2]; /* ptr to next output char slot */ 

static char out[2][P_LINE]; /* output line */ 

static char star[P_LINE]; /* set by starsQ */ 



* print alignment of described in struct path pp[] 

*/ 

static 

pr_align() 
{ 

int nn; /* char count */ 



for (i = 0, lmax = 0; i < 2; i++) { 

nn = stripname(namex[i]); 
if (nn > lmax) 

lmax = nn; 

nc[i]=l; 

ni[i]=i; 
siz[i] = ij[i] = 0; 
ps[i] = seqx[i]; 
po[i] = out[i]; 

} 
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for (nn = nm = 0, more = 1 ; more; ) { 

for (i = more = 0; i < 2; i++) { 



* do we have more of this sequence? 

*/ 

if(!*ps[i]) 

continue; 

more-H-; 

if (PPp]-sp c ) { /* leading space */ 
*po[i]++="; 
pp[i].spc~; 

} 

elseif(siz[i]){ /* in a gap*/ 
*po[i]++ = '-'; 

nzBH 

} 

else { /* we're putting a seq element 

*/ 

*po[i] = *ps[i]; 
if(islower(*ps[i])) 

*ps[i] = toupper(*ps[i]); 

po[i]++; 
ps[i]++; 



* are we at next gap for this seq? 
*/ 

if(ni[i]=pp[i].xrij[i]]){ 



} 



} 

ni[i]++; 



siz[i] = pp[i].n[ij[i]++]; 
while (ni[i] = pp[i] Jf [ij[i]]) 

siz[i]+=pp[i]. n [ij[i]++]; 



} 

if (-Him = olen || Imore && nn) { 
dumpb!ock(); 
for(i = 0;i<2;i++) 

po[i]=out[i]; 

nn = 0; 

} 



* dump a block of lines, including numbers, stars: pr_align() 
*/ 

static 

dumpbloekO 
{ 

register i; 

for (i = 0; i < 2; i++) 

*po[i]~ = '\0'; 
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(void) putc('\n', fx); 
for (i = 0; i < 2; i++) { 

if (*out[i] && (*out[i] != " |l *(po[i]) 
if(i = 0) 

nums(i); 
if(i = 0&& *out[l]) 
starsO; 

putline(i); 

if(i = 0&& *out[l]) 

fprintf(fx, star); 

if(i=J) 

nums(i); 



static 

nums(ix) 



/* index in out[] holding seq line */ 

char nline[P_LINE]; 

register i,j; 

register char *pn, *px, *py; 

for (pn = nline, i = 0; i < Imax+P_SPC; pn-H-) 
*pn = ' '; 

for (i = nc[ix], py = out[ix]; *py; py++, pn++) { 
if(*py = "ll *py = '-') 
*pn = ' '; 



if (i%10 = 0 || (i = 1 && nc[ix] != 1)) { 
j = (i<0)?-i:i; 
for (px = pn; j; j /= 10, px--) 
*px=j%10 + '0'; 

if(i<0) 

*px = 

} 

else 



} 

} 

*pn = '\0'; 
nc[ix] = i; 

for (pn = nline; *pn; pn++) 

(void) putc(*pn, fx); 
(void) putc('\n', fx); 



* put out a line (name, [num], seq, [num]): dumpblock() 



putline(ix) 
int 

{ 
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int i; 

register char *px; 

for (px = namexpx], i = 0; *px && *px ! 

(void) putc(*px, ft;); 
for (; i < lmax+P_SPC; H+) 

(void)putc(",fx); 

/* these count from 1: 

* ni[] is current element (from 1) 

* nc[] is number at start of current line 
*/ 

for (px = outfix]; *px; px++) 

(void) putc(*px&0x7F, fx); 
(void) putc('\n', fx); 



* put a line of stars (seqs always in out[0], out[l]): dumpblockQ 



static 

starsQ 



register char *p0, *pl , cx, *px; 

if (!*out[0] || (*out[0] = " && *(po[0]) = ") II 
!*out[l] || (*out[l] = 1 • && *(po[l]) = 1 ')) 
return; 

px = star; 

for (i = lmax+P_SPC; i; i~) 

*px++ = ' '; 

for (pO = out[0], pi = out[l}; *p0 && *pl ; p0++, pl++) { 
if (isalpha(*p0) && isalpha(*pl)) { 

if (xbm[*pO-'A']&xbm[*pl-'A']) { 



} 

else if (!dna && _day[*pO-'A'][*pl-'A'] > 0) 



*px++ = V; 
*px = '\0'; 
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* strip path or prefix from pn, return Ien: pr_align() 
*/ 

static 

stripname(pn) stripname 
char *pn; /* file name (may be path) */ 

{ 

register char *px, *py; 
py=0; 

for (px = pn; *px; px++) 
if(*px = 7') 

py = px + l; 

if(py) 

(void) strcpy(pn, py); 
return(strlen(pn)); 

} 
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* cleanupO — cleanup any tmp file 

* getseqO - read in seq, set dna, len, maxlen 

* g_calloc() - callocO with error checkin 

* readjmpsO - get the good jmps, from tmp file if necessary 

* writejmpsO — write a filled array of jmps to a trap file: nwQ 



#include "nw.h" 
#indude <sys/file.h> 



long 



*jname = 7tmp/homgXXXXXX"; 

*fj; 



cleanupO; 
lseekQ; 



/* tmp file for jmps */ 
/* cleanup tmp file */ 



* remove any tmp file if we blow 

*/ 

cleanup(i) 
{ 

if(fj) 

(void) unlink(jname); 



} 



int i; 
if(fj) 
exit(i); 



* read, return ptr to seq, set dna, len, maxlen 

* skip lines starting with ';*, '<', or '>' 

* seq in upper or lower case 
*/ 

char * 

getseq(file, len) 

char *file; /* file name */ 



/*; 



q len */ 



char 

register char 



line[1024], *pseq; 
*px, *py; 
natgc, tlen; 

*fp; 



if((fp = fo P en(file,V)) = 0){ 

fprintf(stderr,"%s: can't read %s\n", prog, file); 
exit(l); 

} 

tlen = natgc = 0; 

while (fgets(line, 1024, fp)) { 

if (*line = ';' || *line = *<' || *line = '>') 

continue; 
for (px = line; *px != V; px++) 

if (isupper(*px) || islower(*px)) 
tlen++; 

} 

if ((pseq = malloc((unsigned)(tlen+6))) = 0) { 

fprintf(stderr,"%s: malloc() failed to get %d bytes for %s\n", prog, tlen+6, file); 
exit(l); 

} 

pseq[0] = pseqfl] - pseq[2] = pseq[3] = W; 

Page 1 of nwsubr.c 



47/50 



WO 02/00711 



PCT/US01/20116 



FIGURE 20 o 



py = pseq + 4; 
rewind(fp); 

while (fgets(line, 1024, fp)) { 

if (*line = ';' || *line = '<' || *line = '>') 

continue; 
for (px = line; *px != V; px++) { 
if (isupper(*px)) 

*py++ = *px; 
else if (islower(*px)) 

*py-H- = toupper(*px); 
if(index("ATGCU",*(py-l))) 
natgc++; 

} 

} 

*py-H- = '\0'; 
*py = , \0'; 
(void) fclose(fp); 
dna = natgc > (tlen/3); 
return(pseq+4); 



char * 
g_calloc(msg, nx, sz) 

char *msg; /* program, calling routine */ 

int nx, sz; /* number and size of elements */ 

{ 

char *px, *calloc(); 

if ((px = calloc((unsigned)nx, (unsigned)sz)) — 0) { 
if(*msg){ 

fprintf(stderr, "%s: g_calloc() failed %s (n=%d, sz=%d)\n", prog, msg, nx, sz); 
exit(l); 

} 

} 

return(px); 



* get final jmps from dx[] or tmp file, set ppO, reset dmax: mainQ 



readjmpsO 
{ 



fd = -l; 
siz, i0.il; 



(void) fclose(fj); 

if ((fd = open(jname, OJRDONLY, 0)) < 0) { 

fprintf(stdeTT, "%s: can't openO %s\n", prog, jname); 
cleamip(l); 

} 

} 

for (i = iO = i 1 = 0, dmaxO = dmax, xx = lenO; ; i++) { 
while (1){ 

for (j = dx[dmax].ijmp; j >= 0 && dx[dmax].jp.xD] >= 
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...readjmps 

if 0 < 0 && dx[dmax].offset && fj) { 

(void) lseek(fd, dxfdmaxj.offset, 0); 

(void) read(fd, (char *)&dx[dmax].jp, sizeof(struct jmp)); 

(void) read(fd, (char *)&dx[dmax].offset, sizeof(dx[dmax].offset)); 

dx[dmax].ijmp = MAXJMP-1 ; 

} 

else 

break; 

} 

if(i>=JMPS){ 

fprintf(stderr, "%s: too many gaps in alignments", prog); 
cleanup(l); 

} 

if(j>=0){ 

siz = dx[dmax].jp.n[j]; 
xx = dx[dmax].jp.x[j]; 
dmax += siz; 

if (siz < 0) { /* gap in second seq */ 

pp[l].n[il] = -siz; 

/* id = xx - yy + lenl - 1 
*/ 

pp[l].x[il] = xx- dmax + lenl -1; 
gapy++; 
ngapy -= siz; 
/* ignore MAXGAP when doing endgaps */ 

siz = (-siz < MAXGAP || endgaps)? -siz : MAXGAP; 
il++; 

} 

else if (siz > 0) { /* gap in first seq */ 
P p[0].n[i0] = siz; 
pp[0].x[i0] = xx; 
gapx++; 
ngapx += siz; 
/* ignore MAXGAP when doing endgaps */ 

siz = (siz < MAXGAP || endgaps)? siz : MAXGAP; 
i0++; 

} 

} 

else 

break; 

} 

/* reverse the order of jmps 

*/ 

for 0 = 0, i0~; j < iO; i0~) { 

i = pp[0].nD]; p P [0].n[j] = P p[0].n[i0]; PP [0].n[i0] = i; 
i = pp[0].xD]; pp[0].x[j] = PP [0].x[i0]; PP [0].x[i0] = i; 

} 

for0 = 0,il-;j<il;j++,il-){ 

i = pp[l].nU]; pp[l]-nD'] = PP[l].n[il]; PP[l] n[il] = i; 
i = PP [l].x|j]; p P [l].x[j] = pp[l].x[il J; PPM-xpl] = fe 

} 

if(fd>=0) 

(void) close(fd); 

if(S){ 

(void) unlinkOname); 
fj = 0; 
offset =0; 
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/* 

* write a filled jmp struct offset of the prev one (if any): nw() 

writejmpsCix) writejmps 
int ix; 

{ 

char *mktemp0; 
if(!fi){ 

if (mktemp(jname) < 0) { 

fprintf(stderr, "%s: can't mktempO %s\n", prog, jname); 
cleanup(l); 

} 

if ((fj = fopenGname, ' V)) = 0) { 

fprintf(stderr, "%s: can't write %s\n", prog, jname); 
exit(l); 

} 

(void) fwrite((char *)&dx[ix].jp, sizcof(struct jmp), 1, fj); 
(void) fwrite((char *)&dx[ix].offset, sizeof(dx[ix].offset), 1, fj); 

} 
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